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0. Foreword 

This is a survey paper on the subject of Strong Uniqueness in approximation theory. The concept 
of strong uniqueness was introduced by Newman, Shapiro in 1963. They proved, among other 
things, that if M is a finite-dimensional real Haar space in C(B), B a compact Hausdorff space, 
and if m* is the best approximant to / from M, then there exists a 7 > such that 
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for all m £ M, where 7 may depend upon /, M and m* , but is independent of the specific m. An 
inequality of the above form valid for all m £ M is what we call classical strong uniqueness. This 
property is stronger than the uniqueness of the best approximant (hence the name). 

Strong uniqueness has been much studied, mainly during the 1970s, 1980s and 1990s, and there 
have been over 100 research papers devoted to the subject. We thought this might be a good time 
to reflect upon what was considered and accomplished in the study of this subject. We hope that 
you will agree that the resulting theory is not insignificant. 

This paper is organized as follows. We have divided the survey into three parts. The first part 
is concerned with what we termed above classical strong uniqueness, i.e., inequality (0.1). In Section 
1, we exactly characterize the optimal strong uniqueness constant (largest possible 7 in (0.1)) via 
the one-sided Gateaux derivative. In Section 2, we consider some general results regarding classical 
strong uniqueness in the uniform norm. We look in detail at the case where the approximating 
set is a finite-dimensional subspace, give a characterization of when we have strong uniqueness, 
and also prove that in this setting the set of functions with a strongly unique best approximant is 
dense in the set of functions with a unique best approximant. Finally, we provide a general upper 
bound on the optimal strong uniqueness constant based on projection constants. In Section 3, 
we consider the relationship between local Lipschitz continuity of the best approximation operator 
at a point and classical strong uniqueness at the same point. A general result is that the latter 
always implies the former. We prove that the converse holds when approximating from a finite- 
dimensional subspace in the uniform norm. In Section 4, we restrict our attention to approximation 
from finite-dimensional Haar spaces in the uniform norm. We first prove the result of Newman, 
Shapiro [1963] mentioned above, i.e., that in this case, we always have strong uniqueness. We then 
consider various properties of the optimal strong uniqueness constant with respect to the function 
being approximated. We prove, for example, that the optimal strong uniqueness constant is upper 
semi-continuous but not necessarily continuous, and it is not uniformly bounded below by a positive 
constant (i.e., bounded away from 0) if the subspace is of dimension at least 2 and the underlying 
domain is not discrete. We find lower bounds (that sometimes provide equality) for the optimal 
strong uniqueness constant via specific elements of the approximating subspace. Assuming the 
subspace is of dimension n and the error function only attains its norm at n + 1 points (the generic 
case), we obtain that the optimal strong uniqueness constant is bounded above by 1/n. We also 
give another characterization of the optimal strong uniqueness constant and consider the question 
of when strong uniqueness and uniqueness are equivalent concepts. In Section 5, we look at classical 
strong uniqueness in the L 1 norm. Among other results, we prove that if v is a non-atomic positive 
measure then the set of functions in L 1 (K, v) that have a strongly unique best approximant from 
any finite-dimensional subspace is dense in L l (K, u). In addition, under the above assumptions, we 
show, as in the uniform norm, that a function in L (K, v) has a strongly unique best approximant 
if and only if the best approximation operator from the same finite-dimensional subspace is locally 
Lipschitz continuous. Furthermore, in this case, we get explicit upper and lower bounds for the 
optimal strong uniqueness constant based on the Lipschitz continuity of the best approximation 
operator. We also present some results on classical strong uniqueness in the problem of one-sided 
L 1 approximation. In Section 6, we consider approximation by rational functions of the form 

R m ,n ■= {r = p/q : p £ U m , q £ U n ,q(x) > 0, x £ [a, b}}, 

where II n = span{l,x, . . . ,x n }. The main result reported on is that we have the equivalence of 
strong uniqueness to a function / from R m ,n, the operator of best rational approximation from 
Rm,n being continuous at /, and the fact that the unique best rational approximant to / from 
R m ,n is n °t contained in i? m _ 1)Jl _i. 
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In the second part of this survey, we discuss what we call non-classical strong uniqueness. By 
this, we mean the existence of a nonnegative strictly increasing function <j) defined on R + , and a 
constant 7 > that may depend upon / and M (and thus on m*), for which 



for some a > (a local estimate). We start in Section 7 with the case of a uniformly convex norm 
and prove the basic inequality 



valid for all m G M where m* G M is a best approximant to /. Here 5 is the usual modulus of 
convexity of the uniformly convex space. We apply this result and consider also variants thereof. In 
Section 8, we return to a consideration of the uniform norm. The main result we report on therein 
is that non-classical strong uniqueness holds for (j){t) = t 2 if M C C 2 [a, b] is a finite-dimensional 
unicity space (and not necessarily a Haar space) with respect to C 2 [a, b]. In Section 9, we return to a 
consideration of the L -norm and obtain two local non-classical strong uniqueness estimates. In one 
case, we prove that if M is a finite-dimensional unicity space for all / G C[a, b] in the L 1 ^, b] norm 
then we have non-classical strong uniqueness with 4>(t) = tu~ 1 (f; Dt) for some constant D > 
that depends only on / and M. Here u is the standard modulus of continuity. See Theorem 9.2 for 
a more detailed explanation. A second result concerns the one-sided L 1 approximation problem. 
We prove that if M C C 1 [a, b] is a finite-dimensional unicity subspace for C 1 [a, b] in the one-sided 
L 1 -norm, then we have non-classical strong uniqueness with <p(t) = tHJ 1 (Dt) for some constant 
D > that depends only on / and M, where H is based on the moduli of continuity of /' and m' 
for all m G M. In Section 10, we consider strong uniqueness when we approximate complex-valued 
functions in the uniform norm. The main result therein is non-classical strong uniqueness with 
4>(t) = t 2 . We also discuss conditions under which classical strong uniqueness holds. 

The third part of this survey concerns various applications of strong uniqueness results. The 
main idea behind these applications is that instead of solving a best approximation problem in a 
given norm, we replace it by considering another norm, close to the original norm, one that leads to 
a simpler approximation problem. Strong uniqueness is then applied in order to show that the best 
approximant in this new norm is sufficiently close to the original best approximant. An estimation 
of the "closeness" is given in Section 11. Typically, the original norm is modified by replacing it by 
a similar discrete norm (as considered in Section 12), or by introducing a weight function into the 
norm. In Section 13, we use strong uniqueness results in order to solve approximation problems in 
the case when the norm is altered by a weight function. This approach is used to derive asymptotic 
representations for weighted Chebyshev polynomials. 

We have tried to make this survey as self-contained as seemed reasonable. As such there are 
many additional peripheral results included in this paper that, we hope, put the strong uniqueness 
results into a reasonable context. 

On the other hand, while this survey is lengthy, it is far from complete. Certain specific 
problems and general topics within the area of strong uniqueness have not been surveyed. For 
example, a lot of effort and many papers have been concerned with the study of 7 n (/), the optimal 



||/-m||-||/-m*|| >7<K||m-m*||) 



for all m G M (a global estimate), or for all m G M such that 
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strong uniqueness constant (in (0.1)) when approximating / G C[a, b] in the uniform norm from 
Tl n = spanjl, x, . . . , x n }. Many different questions were asked concerning this important case. 
Nevertheless, we have not considered this class of problems. Thus, as is noted in Section 4, Poreda 
[1976] raised the question of describing the asymptotic behaviour of the above sequence (7n(/)) as 
n tends to oo for a given function /. Henry and Roulier [1978] conjectured that 

liminf 7 n (/) = 

n— >oo 

if / is not a polynomial. This conjecture, sometimes called the Poreda conjecture, was proved for 
various classes of functions over a string of papers. It was finally solved in the positive by Gehlen 
[1999]. It should be noted that the above liminf cannot be replaced by the simple limit. There are 
functions / G C[a, b] for which 

liminf 7„(/) = 0, limsup7 n (/) = 1; 

n->oo n->oo 

see Schmidt [1978]. These same 7 n (/) were also studied for specific functions /, or specific classes 
of function; see for example Henry, Huff [1979], Henry, Swetits [1981], Henry, Swetits, Weinstein 
[1981], Henry, Swetits [1982] and Henry, Swetits, Weinstein [1983]. There are also papers devoted 
to looking for sets in C[a, b] over which the strong uniqueness constants are uniformly bounded 
below by a positive constant; see for example Bartelt, Swetits [1983], Marinov [1983], and Bartelt, 
Swetits [1988], as well as papers devoted to looking at the optimal strong uniqueness constant as 
a function of the domain; see for example Henry, Roulier [1977], Bartelt, Henry [1980], and Paur, 
Roulier [1981]. For a study of strong uniqueness when approximating with constraints, leading to 
non-classical strong uniqueness, see for example Fletcher, Roulier [1979], Schmidt [1979], Chalmers, 
Metcalf, Taylor [1983] and Kroo, Schmidt [1991]. All the above is with regards to approximation 
from n n . In addition, strong uniqueness when approximating by splines, with either fixed or 
variable knots, was considered in Niirnberger, Singer [1982], Niirnberger [1982/83], Niirnberger 
[1994], Sommer, Strauss [1993], Zeilfelder [1999] and Zwick [1987]. Again this is not discussed in 
what follows. 

We have tried to give exact references, and have also endeavored to make the list of references 
complete. This list contains all references we have found on the subject of strong uniqueness. Note 
that not all papers on the list of references are referred to in the body of this survey. We apologize 
for any omissions and would appreciate information with regard to any additional references. 



Part I. Classical Strong Uniqueness 



1. Introduction 

Let A be a normed linear space and M a subset of A. For any given / G A, we let Pm(/) denote 
the set of best approximants to / from M. That is, m* G Pm(/) if m* G M and 

11/ -ml < 11/ -m|| 

for all m G M. This set Pm(/) may be empty (non-existence), a singleton (unicity) or larger. In 
this section, we consider conditions on a linear subspace or convex subset M of A under which we 
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obtain Classical Strong Uniqueness. By this, we mean the existence of a strictly positive constant 
7 > for which 

||/ — m\\ — ||/ — m*|| > j\\m — m*\\ 

for all m G M where m* G -Pm(/). The constant 7 > can and will depend upon /, m* and M, 
but must be independent of m. Note that the form of this inequality is the best one might expect 
since, by the triangle inequality, we always have 

||/ — m\\ — ||/ — m*\\ < \\m — m*\\. 

We first exactly characterize the optimal (largest) 7 for which such an inequality holds. This 
will be done via the one-sided Gateaux derivative that is defined as follows. Given f,g G X, set 



\ r \\f + tg\\ n, u / . 1 

T+(f,g)= km . (1.1) 

t->0+ t 

The value T + (f,g) is termed the one-sided Gateaux derivative of the norm at / in the direction g. 
The first thing we prove is that this one-sided Gateaux derivative always exists. 

Proposition 1.1. The one-sided Gateaux derivative of the norm at f in the direction g, namely 
T+(f,g), exists for every f,g G X. 

Proof: Set 

11/ + tell - 



rit) , 

We claim that r(t) is a non-decreasing function of t on (0, 00) and is bounded below thereon. As 
this is valid then r+ (/,<?) necessarily exists. 

To see that r(t) is bounded below on (0, 00), note that 

\\f + tg\\>\\f\\-\\tg\\ = \\f\\-t\\g\\. 

Thus for t > 0, we have r(t) > —\\g\\. The non-decreasing property of r can be shown as follows. 
Let < s < t. Then 



4f + sg\\ = \\tf + tsg\\ = |K/ + tg) + (t - < + tg\\ + (t - s) 

Thus 

t(\\f + sg\\-\\f\\)<s(\\f + tg\\- 

and that implies r(s) < r(t). □ 

Using this functional r + , it is now a simple matter to characterize best approximants from 
linear subspaces. Namely, 

Theorem 1.2. Let M be a linear subspace ofX. Then m* G Pm (/) if and only if r+ (f—m* , m) > 
for all m G M. 

Proof: (=>) Assume m* is a best approximant to / from M. Therefore 

\\f-m*+tm\\ > ||/-m*|| 
for every m G M and t > 0, immediately implying that r+(/ — m*,m) > 0. 

(4=) Assume T+(f — m*,m) > for all m G M. As M is a linear subspace, this implies that 
t_|_(/ — m*,m* — m) > 0. Prom the proof of Proposition 1.1, r(t) is a non-decreasing function of t 
on (0, 00). Setting t = 1 therein with respect to r + (/ — m*,m* — m), we obtain 

||/ — m* + m* — m\\ — \\f — m*\\ > r + (/ — m* ,m* — m) > 0. 

Thus ||/ — m\\ > ||/ — m*\\ for all m G M. □ 
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If M is a convex subset of X, then the same arguments prove the following. 

Corollary 1.3. Let M be a convex subset of X. Then m* G -Pm(/) if and only if r + (/ — m*,m* — 
m) > for all m G M. 

If r+(/ — m*, m* — m) > for all m G M, m / m* , then we may have a 7 > for which 

||/ — m|| — ||/ — m*\\ > 7||m — m*|| 

for all m G M. When there exists a 7 > for which 

||/ — m\\ — ||/ — m*\\ > 7||m — m*\\ 

for all m G M, then we say that m* is a strongly unique best approximant to / from M. The reason 
for this terminology is simply that strong uniqueness is stronger than uniqueness. That is, if m* is 
a strongly unique best approximant to / from M, then it is certainly a unique best approximant. 
While the converse does not hold in general, it does and can hold in various settings. This concept 
was introduced in Newman, Shapiro [1963] with respect to certain specific spaces. We will state 
and prove their results in later sections. 

This next result characterizes the optimal (largest) 7 for which such an inequality holds. We 
prove this result under the assumption that M is a linear subspace. The case where M is only a 
convex subset follows analogously, and we will subsequently formally state it as a corollary. 

Theorem 1.4. Let M be a subspace of X and f G X\M. Assume m* G Pm(/)- Set 

7(/) := inf{r + (/ — m*,m) : m G M, \\m\\ = 1}. 
Then j(f) > and, for all m G M, we have 

11/ - m \\ ~ 11/ - > 7(/) \\ m ~ m *\\- 

Furthermore, if 7 > j( f) then there exists an m G M for which 

||/ — rh\\ — ||/ — m*\\ < 7 || m — m*\\. 

Proof: As r + (/ — m*,m* — m) > for all m G M (M is a subspace), we have 7(/) > from 
Theorem 1.2. Assume j(f) > 0. From the fact that r + (f,cg) = cr + (f,g) for all c > 0, it follows 
that 

r_|_(/ — m*,m* — m) > 7(/)||m — m*|| 



for all m G M. From the proof of Proposition 1.1, 

||/ - m* + t{m* - m)\\ - \\f - m* 



> r_|_ (/ — m* , m* — m) 



t 

for all £ > 0. Setting t = 1, we obtain 

||/ — m\\ — ||/ — m*|| > r_|_(/ — m* ,m* — m) > 7(/)||m- — m* 

for all m G M. 
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Now assume 7 > 7(/). By definition, there exists an m G M, \\m\\ = 1, for which 

T+(f ~ m*,m) < j\\m\\. 
Thus for t > sufficiently small, we have 

11/ — m* + tm\\ — 11/ — m*\\ 
~ t < l\\Mh 

implying 

||/ — m* + tm\\ — ||/ — m*\\ < 7||im||. 
Setting m := m* — tm, we obtain 

||/ — fh\\ — ||/ — m*\\ < 7||m — m*\\. □ 
For a convex set M, this same reasoning gives: 
Theorem 1.5. Let M be a convex subset of X and f G X\M. Assume m* G Pm{/)- Set 

1(f) ■= inf ( ^(/ -^"' -"P : £ M, m 7^ m* 
[ ||m* — m|| 

Then 7(/) > and for ad m G M, we have 

||/ — m|| — ||/ — m*|| > 7(/) \\m — m*\\. 
Furthermore, if 7 > 7(/) then there exists an fh G M for which 

||/ — m|| — ||/ — m*|| < 7 ||m — m*||. 

In much of this paper, we consider approximation from linear subspaces and not from convex 
subsets. However, most of the results obtained can be easily generalized to convex subsets, as 
above. We leave their exact statements to the interested reader. 

Note that if M is a finite-dimensional subspace (or closed convex subset thereof) then a com- 
pactness argument implies that the inf in the definition of j(f) in both Theorems 1.4 and 1.5 is in 
fact a min. In the former case, we rewrite an essential consequence of Theorem 1.4 as: 

Proposition 1.6. Let M be a finite-dimensional subspace of X and f G X\M. Assume m* G 
Pm(/)- Then m* is a strongly unique best approximant to f from M, i.e., 7(/), as defined in 
Theorem 1.4, is strictly positive, if and only if there does not exist an m G M, for which 

r+(/-m*,m) = 0. 

For many normed linear spaces, we have that 

T(/, t ) = ltm" + »l'-"l' 

t^O t 

exists for all /, g G X, / 7^ 0. (Note that this is the two-sided limit that need not in general exist.) 
If this is the case then, for any linear subspace M, we have that m* G Pm(I) if and only if 

r(/ — 772*, 777 ) = 

for all 777 G M. Thus, we never have classical strong uniqueness in such spaces. If the functional 
r(f,g) exists for all f,g G X, / 7^ 0, then the space X is said to be smooth. Strong uniqueness 
cannot hold in smooth spaces as strong uniqueness implies that the unit ball has corners. The 
space X being smooth is equivalent to the existence, for each / G X, / 7= 0, of a unique continuous 
linear functional h G X* of norm 1 satisfying h(f) = ||/||. This is true, for example, in any Hilbert 
space and in L p for every p G (1, 00). In all these cases, we must look for other non-classical type 
strong uniqueness formulae. 

The approach that was taken in this section to characterizing best approximants and classical 
strong uniqueness may be found in Papini [1978] and Wulbert [1971]. 
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2. Classical Strong Uniqueness in the Uniform Norm 

In this section, we assume that B is a compact Hausdorff space and consider C(B), the normed 
linear space of continuous real- valued functions defined on B, with norm 

= max|/(x)|. 



For each / G C(B), set A f = {x : |/(x)| = ||/||} and 

sgn/(x) = 



1, f(x) > 0, 
0, f(x) = 0, 
l-l, /(x)<0. 



The formula for the one-sided Gateaux derivative T + (f,g) in C(B) is the following. 
Theorem 2.1. For f,g€ C{B), f / 0, we have 

T+(f,g) = max [sgn f(x)] g(x). 

x€A f 

Proof: We first prove that the right-hand-side is a lower bound for r + (f,g). To this end, let 
x G A f . Thus |/(x)| = ll/H and \f(x)+tg(x)\ < \\f + tg\\. Therefore 

ff . y \\f + tg\\ 

r+(f,g) = hm 



t^o+ t 
>lim \m + tg(x)\-\f(x)\ 

t^o+ t 
_ [sgn f(x)} (f(x) + tg{x)) - [sgn f(x)] fjx) 

t^-o+ t 
- lim t [ s S n /( :r )]f(a ; ) 



This implies that 



t^o+ t 
[sgnf(x)]g(x). 



r+{f,g) > max [sgn f(x)] g(x). 

x€A f 



We prove the converse direction as follows. For each t > 0, let x t satisfy 

\\f + tg\\ = \f(x t )+tg(x t )\. 

As B is compact, there exists an x* 6 B that is a limit point of the x t as t — > 0. Let t n denote a 
sequence, decreasing to zero, along which x tn ■= x n converges to x* . We first claim that we must 
have x* 6 Af. If not, then 



||/ + t n g\\ = \f(x n ) + t n g(x n )\ -> |/(x*)| < 

contradicting the continuity of the norm. As x* G ^4/, it therefore follows that for n sufficiently 
large, 

sgn(/(x n ) + t n g{x n )) = sgn/(x*). 



Strong Uniqueness 

Thus 

\\f + t n g\\ - ll/H [sgnf(x*)}[f(x n ) + t n g(x n )} - [sgn f(x* )]/(**) 



Note that 



[sgn /(£*)] 



[sgn/(x*)] 



fM - f{x*) 

f(x n )-f(x*) 



+ [sgnf(x*)]g(x n ). 



< 



since t n > and [sgn f(x*)]f(x n ) < \\f\\ = \ f{x*)\ = [sgn f(x*)]f(x*). Take limits on both sides 
of the above equality. The left-hand-side has the finite limit r + (f,g), the first term on the right- 
hand-side is nonpositive while 

lim [sgnf(x*)]g(x n ) = [sgn f(x*)] g(x*) < max [sgn f(x)} g(x). 

n— >oo x ^Af 



Thus 



r+(f,g) < max [sgnf(x)]g(x). □ 

xeA f 



As a consequence of Theorem 2.1, Theorem 1.2 and Theorem 1.4, we have: 
Theorem 2.2. Let M be a linear subspace of C(B). Then m* G Piwif) if and oniy if 

T + (f — m*,m)= max [sgn(/ — ra*)(x)] m(x) > 

for aJI m G M. Furthermore, m* is a strongly unique best approximant to f from M if and only if 

j(f) = inf max [sgn(/ — m*)(x)] m(x) > 0. 

||m|| = l J 

This characterization of a best approximant is called the Kolmogorov criterion. This charac- 
terization of the optimal strong uniqueness constant 7(f) first appeared in Bartelt, McLaughlin 
[1973]. If M is finite-dimensional, then the above inf can be replaced by a min. This is not true in 
infinite dimensions. An example illustrating that fact may be found in Bartelt, McLaughlin [1973]; 
see Example 4. 

In the finite-dimensional setting, the above characterization results can be further refined. Wc 
firstly state the following well-known result. 

Theorem 2.3. Let M be an n-dimensional subspace of C(B). Given f G C{B), we have m* G 
Pm(I) if and only if there exist k distinct points x±, . . . , x^ in Af_ m * , and strictly positive values 
Ai, . . . , Afc, 1 < k < n + 1, such that 



^2 Aj[sgn(/ - m*)(x i )}m(xi) = 



for all m G M. 



(A variation on a proof of Theorem 2.3 is given by the proof of Theorem 10.3.) 
We also have the following result that is proved in Brosowski [1983], and was also later proved 
in Smarzcwski [1990], generalizing results from Bartelt [1974] and Niirnberger [1980]. 
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Theorem 2.4. Let M be an n-dimensional subspace of C(B). Given f G C(B)\M, we have that 
m* G M is the strongly unique best approximant to f from M if and only if there exist k distinct 
points xi, . . . , Xk in Af_ m * , and strictly positive values Ai, . . . , A&, with n + 1 < k < 2n, such that 

k 

Y,K[sgn(f-m*)(x l )}m(x l )=0 (2.1) 

i=l 

for all m G M, and 

dimM| {:cl! ... )Xfc} =n. 
Proof: We first prove the "easy" direction in this theorem. 

(<^=). Assume there exist points {xi}f =1 satisfying the conditions of the theorem. As the x\, . . . , Xk 
are in Af_ m *, we have 

7(/) > 7 = rnin max sgn(/ — m*)(xi)m(xi). 

mGM i=l,...,k 
||m|| = l 

We claim that 7 > 0. If this holds then "f(f) > and m* is the strongly unique best approximant 
to / from M. To see that 7 > 0, recall that since M is a finite-dimensional subspace then from 
a compactness argument the above minimum is always attained. Thus if 7 < 0, there exists an 
m G M, ||m|| = 1, for which 

sgn(/ — m*){xi)fh(xi) < 

for i = 1, . . . , k. Moreover as fh £ M, fh 7^ 0, then from the fact that dimMl.^^ ... jXfc } = n, it 
follows that fh(xi) / at at least one of the Xi. As A« > 0, % = 1, . . . , k, this then implies 

k 

Ai sgn(/ - m*)(xi)m(xi) < 

i=i 

which contradicts (2.1). Thus 7 > 0. 

(=^). Assume m* G M is a strongly unique best approximant to / from M. Without loss of 
generality, we will assume m* = 0. 

We start with the case where dimM = 1. Let M = spanjmi}. Since 

7(/) = max[sgn/(x)](±mi(x)) > 0, 

x€A f 

there must exist points xi,x 2 G Af for which we have that both [sgn/(xi)]mi(xi) > and 
[sgn f(x 2 )]mi(x2) < 0. Thus, there exist Ai,A2 > satisfying 

Ai[sgn/(xi)]m(a:i) + A 2 [sgn f(x 2 )]m(x 2 ) = 

for all m G M, and dimM|{ a , 1 jX2 j = 1. This is the desired result. 
We now consider the general case where dimM = n. Since 

max[sgn/(x)]m(x) > c > 

xeA f 

for all m G M, ||m|| = 1, it follows that G M n is in the strict interior of the convex hull of 
E = {([sgnf(x)]m 1 (x),...,[sgnf(x)]m n (x)) : x£A f }cR n 
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where {mi, . . . ,m n } is any basis for M. From a generalization of Caratheodory's Theorem, essen- 
tially due to Steinitz, see Danzer, Griinbaum, Klee [1963], the vector 6 1" is in the strict interior 
of the convex hull of some set of at most 2n points of E. That is, there exist x\, . . . ,Xk £ Af, 
k < 2n, such that G IR n is in the strict interior of the convex hull of 

E* = {([sgn/(xi)]mi(xi), . . . , [sgn f(xi)]m n (xi)) : i = l,...,k}. 

From the fact that 6 M. n is in the strict interior of the convex hull of E*, it easily follows 
that there exist Ai, . . . , A* > 0, Yli=i \ = 1) f° r which 

fc 

= ^2 Xi[sgn.f(xi)]mj(xi), j = 1, . . . ,n, 

i=l 

implying 

fc 

= ^2 K[sgnf(xi)}m(xi), 

i=l 

for all m € M, and also that the vectors 

([sgn/(xj)]mi(a;j),.. . , [sgn f(xi)]m n (xi)), i = l,...,k, 
span M. n . Since sgn/(xj) / for each i, this is equivalent to the fact that the vectors 

(mi(s j ),...,m n (j; j )), i = l,...,/c, 

span M n , i.e., 

dimM l{^i,..,x fc } 

From this fact and 

fc 

= ^ K[sgnf(xi)]m(xi), 
i=l 

for all m G M, it follows that /c > n + 1. □ 

Is the bound on k, namely n + 1 < k < 2n, the correct bound? The answer is yes. The value 
k = n + 1 is minimal. If k < n there is always a nontrivial m £ M that vanishes at x\, . . . , Xk-i 
and then min{[sgn(/ — m*)(xfc)](±m(xfc))} < implying 7(/) = 0. In the next section, we see 
that in the Haar space setting, we can and do have k = n + 1. The case k = 2n is necessary if, for 
example, M is spanned by n functions with disjoint support. To see this, assume mi, . . . , m n is a 
basis for M where these basis functions have disjoint support. Assume / G C{B)\M and consider 
(2.1). In order that dimM|{ Xl! ... ;Xk } = n > it is necessary that among the xi, . . . , Xk there is at least 
one point in the support of each rrij. But we must have at least two points in the support of each 
nij among the xi,...,Xk if (2.1) is to hold. Thus k = 2n is necessary for strong uniqueness in this 
case and, by the above, there do exist / G C(B)\M with strongly unique best approximants from 
M. 

It might be conjectured that uniqueness and strong uniqueness are equivalent properties in 
C{B). This, however, is not true, as can be seen from this example taken from Cheney [1966, 
p. 82]. 
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Example. Let M = spanjx} in C[— 1, 1], and set f(x) = 1 — x 2 . As is easily verified, the unique 
best approximant to / from M is m*(x) = 0. On the other hand, m* is not a strongly unique best 
approximant to / from M since Af = {0}, and m(0) = 0, i.e., t_|_(1 — x 2 ,x) = 0. 

Nevertheless, while uniqueness and strong uniqueness are not equivalent properties in C(B), 
the set of functions with a strongly unique best approximant is dense in the set of functions with a 
unique best approximant when approximating from a finite-dimensional subspace. This next result 
and the subsequent Corollary 2.7 are from Niirnberger, Singer [1982]. The proof given here is from 
Smarzewski [1988]. 

Theorem 2.5. Let M be a finite-dimensional subspace ofC(B). Then the set of functions with a 
strongly unique best approximant is dense in the set of functions with a unique best approximant. 

Note that we have an algebraic characterization for a best approximant and an algebraic char- 
acterization for a strongly unique best approximant from any finite-dimensional subspace (Theo- 
rems 2.2, 2.3 and 2.4). However there is no known algebraic characterization for when we have a 
unique best approximant. We get around this by proving the next proposition. We start with some 
notation. 

Let us assume, for ease of presentation, that the zero function is a best approximant to / from 
M. Thus from Theorem 2.2, we have 

max[sgn/(x)]m(x) > 

x€A f 

for all m G M. Let (k n ) be any strictly decreasing sequence of positive numbers that converges to 
zero, and assume k n < 1/2 for all n. Set 

B n :={s:|/(aO| > (1 - k n )\\f\\}, n = 2,3,... . 

Then we have: 

Proposition 2.6. Let M be a hnite-dimensional subspace of C(B). If the zero function is the 
unique best approximant to f from M then for each n, there exists an a n > such that 

max [sgn f{x)]m{x) > a n ||m|| 

xeB„ 

for all m G M, where B n is as above. 

Remark. The above is a necessary, but not a sufficient condition, for the uniqueness of the best 
approximant. As an example, consider f(x) = 1 — \x\ on [—1,1], and M = spanjmi} where 
mi(x) = x on [—1, 1]. Then, as is readily verified, B n = [— k n , k n ] and 

max [sgn f(x)]m(x) = A; n ||m|| 
xeB„ 

for all m 6 M. Moreover, while the zero function is a best approximant to / from M, it is not the 
unique best approximant as we also have ±mi G Pm{I)- 

Proof: Assume to the contrary that the desired inequality does not hold for some n > 2. This 
inequality trivially holds for m = 0. As such, we can consider it over the boundary of the unit ball 
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of M, and since M is a finite-dimensional subspace, we can use compactness to affirm that there 
exists an rh G M, \\m\\ = 1, such that 

max [sgn /(x)]m(x) < 0. 

x€B n 

Choose any a > satisfying a < fc n ||/||. Thus 

\arh(x)\ < \f{x)\ 

for all x G -B„, and therefore, we also have thereon 

|/(x) + am(x)| < |/(x)| + a[sgn/(x)]m(x) < |/(x)| < 

On the other hand, for x G B\B n , we have 

\f(x) + am(x)\ < \f(x)\ + a\m(x)\ < (1 - fc n )||/|| + M/ll 

Thus -am 6 -Pm(/) 5 contradicting the fact that the zero function is the unique best approximant 
to / from M. □ 

One immediate consequence of the above proposition is the following. 

Corollary 2.7. If M is a subspace of then every unique best approximant to / G f ™ from M 
is also a strongly unique best approximant to f from M. 

Proof: For n sufficiently large, we always have, in this case, B n = Af. Apply Proposition 2.6 and 
the characterization of strong uniqueness found in Theorem 2.2. □ 

Proof of Theorem 2.5: If / G M, there is nothing to prove. As such, we assume that 
/ G C(B)\M. Without loss of generality, we assume that the zero function is the unique best 
approximant to / from M. Let B n and a n be as in Proposition 2.6. 

By the Tietze-Urysohn Theorem, see e.g., Kuratowski [1966], there exists a f n G C(B) such 

that 

f n ( x ) 11/11 s gpf( x )> xeB n+2 

and 

(l-k n )\\f\\<f n (x)< 

for x G B n . If B n = B n+2 , then we simply set f n (x) = \\f\\ sgn/(x) thereon. We extend /„ to all 
of C(B) by setting 

fn(x) = fix) 

for x G B\B n . Note that we do not lose continuity in the case where B n = B n+2 since in that case 

{x:(l-fc B+ 2)||/||>|/(x)|>(l-fc B )||/||} = 0. 
From this construction, we have 

-/„(*)! <M/|| 

for all x G B. That is, we have 

lim ||/ -/J =0. 

n— >oo 

Now, since Af n 5 B n+2 5 Af, it follows from Theorem 2.2 that G PM(fn) for all n. 
Furthermore from Proposition 2.6, we have 

min max [sgn f n (x)]m(x) > min max [sgn f(x)]m(x) > a n+ 2 > 0. 

II m 11=1 tn l|m||=l " + 2 

Thus, again applying Theorem 2.2, we see that the zero function is the strongly unique best 
approximant to f n from M. □ 



f{x), x G -Bn n (B\B n+2 ) 
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The above begs the question of when the set of functions with a unique best approximant 
from a given finite-dimensional subspace M is dense in C{B). A similar question was considered 
by Garkavi [1964], [1965] who proved that the n-dimensional subspace M of C(B) has the property 
that every function in C(B) has a unique best approximant from M except for a set of first 
category in C(B) if and only if on each open subset D of B there identically vanish at most 
(n — = max{n — |D|,0} linearly independent functions of M, where \D\ is the number of 

points in D. Thus, for example, the set of functions with a strongly unique best approximant from 
M is dense in C[a, b] if no m G M, m / 0, vanishes identically on an open interval. 

Bounds for j(f ) are difficult to obtain. The following result, from Grothmann [1989], gives an 
upper bound for j(f). In general, using it to find explicit bounds is very difficult. 

Before stating the result, we recall the definition of a projection constant of a subspace L in a 
normed linear space X. It is given by 

X(L;X) := inf{||P|| : P : X -> L is a projection}. 

Proposition 2.8. Let M be a linear subspace of C(B). Let f G C{B) and assume m* G M is a 
strongly unique best approximant to f. Then 

\{M\ Af _ m ,;C{A f - m .)) 
7Uj " A(M;C(P)) 

Proof: Since m* is a strongly unique best approximant to / from M, we have that 

r y(f)= min max [sgn(/ — m*)(x)]m(x) > 0. 

l|m||=l ' 

This also implies that 

|| m IU / _ m « = max \m(x)\ 



xeA 



f — m* 



is a norm on M since no m G M, can vanish identically on Af_ m * . From the above formula 

for 7(/), we have 

7(f) < min max |m(x)| 

\\m\\ = l 1 m 



II II f — m* 

mm — 



mgM ||m|| 

llml 



max — — 

meM \\m j, 

II II ™-f — m* 



Let P : C(Af_ m «) ->■ M| j4/ _ . be a projection. Define a : M\ Af _ , ->■ M and : C(P) 

(T(m| A/ _ m ») =m 



for m G M and 



0(/) = /U^, 



for / G C(B). The linear operator <r is well-defined since no m G M, can vanish identically 

on Af_ m *. Set 

Q(/) := <r(P(<K/))). 
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Q is a projection from C(B) onto M. Thus 

A(M;C(B))<||g||<H||P||W. 
Now H0II = 1 and, by the above, ||cr|| < l/j(f). Thus 

A(M;C(B))<-L||P|| 

for all projections P : C(Af_ m *) — >■ M|,i m „ . Therefore 

7U; - A(M;C(P)) 

In Section 4, where we assume M is a Haar space, we discuss other somewhat more computable 
bounds for 7(/). 

3. Local Lipschitz Continuity and Classical Strong Uniqueness 

We start once again with an arbitrary normed linear space X. Assume M is a subset of X, and to 
/ € X there exists a unique best approximant from M. If, for a given / G X, we have an inequality 
of the form 

\\P M {f) - P M (g)\\ <<y\\f-g\\ 

valid for all g G X and any element of Pm{9), then we say that the best approximation operator 
from M is locally Lipschitz continuous at /, and call a a Jocai Lipschitz constant. 

One of the "uses" of classical strong uniqueness is that it implies local Lipschitz continuity. 
Before proving this result, we recall that we always have, for every f,g G X, 

\\\f-P M (f)\\-\\ g -P M (g)\\\<\\f- g \\. 

To verify this, note that, assuming ||/ — Pm(/)|| > \\g — Pm (flOH, then 

11/ - Pm(/)|| < 11/ - P M (g)\\ < 11/ - g\\ + \\g - P M (g)\\, 
from which the result follows. 

Theorem 3.1. Assume M is a subset of a normed linear space X, f G X, and for some 7 > 0, we 
have 

||/-m||-||/-P M (/)||> 7 || m -P M (/)|| 
for all m G M. Then for each g G X and any element of Pm{9) 

||Pm(/)-Pm(<7)II<-||/-<7||- 

7 

Proof: By assumption, and an application of the previous inequality, 

l\\PM(f) - Pm(9)\\ < 11/ - Pm(9)\\ ~ 11/ - PmU)\\ 
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< ||/ - g\\ + \\g - P M (g)\\ - \\f - P M (f)\\ < 2||/ - g\\. 

Thus 

\\P M (f)-P M (9)\\<l\\f-g\\. □ 

In the specific case where M is a finite-dimensional Haar space, see Section 4, local Lipschitz 
continuity of the best approximation operator was proved in Freud [1958] by a different method of 
proof. For polynomial approximation on an interval, this result can be found in Kirchberger [1902, 
p. 18-21] (see also Borel [1905, p. 89-92]). 

We have proven that strong uniqueness at / implies local Lipschitz continuity at /. The 
converse direction, i.e., local Lipschitz continuity implying strong uniqueness, does not necessarily 
hold. It certainly does not hold in an inner product space where we always have 

\\P M (f) - Pm(9)\\ < \\f-g\\ 

for all /, g if M is a subspace. That is, we have local Lipschitz continuity and do not have strong 
uniqueness, see the discussion at the end of Section 1. However in C(B) the converse does hold, 
i.e., local Lipschitz continuity at / € C(B) does imply strong uniqueness at this same /, assuming 
M is a finite-dimensional subspace. This next theorem is to be found in Bartelt, Schmidt [1984]. 

Theorem 3.2. Let B be a compact Hausdorff space and M a finite-dimensional subspace ofC(B). 
For given f G C(B), the following are equivalent: 
(I) There exists a 7 > such that 

||/- m ||-||/-P M (/)||> 7 || m -P M (/)|| 

for all m G M. 
(II) There exists a a > such that 

\\P M (f)-PM(9)\\<<T\\f-g\\ 

for all g G C{B). 

Proof: From Theorem 3.1, we have that (I) implies (II). It remains to prove the converse direction. 
We prove the result by contradiction. That is, we assume that (I) does not hold for a given / and 
will prove that (II) does not hold for this same /. Note that if / G M then (I) and (II) hold, while 
if the best approximant to / from M is not unique, then neither (I) nor (II) hold. As such, we 
assume / ^ M and the best approximant to / from M is unique. 

Without loss of generality, we can and will assume that the zero function is the unique best 
approximant to /, and ||/|| = 1. Now, since (I) does not hold, there exists, for each e > 0, an 
m £ G M satisfying 

||/-^e|| < 11/11 +e||™ £ || = l + e||m e ||. 

Note that this implies that m £ / 0. We divide most of the proof into a series of four lemmas. 

Lemma 3.3. There exist sequences (S n ) and (a n ) of positive numbers tending to zero and an 
m G M, \\rh\\ = 1, such that 
(i) \\f-a n m\\ < 1 + 
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(ii) rh(x) sgn/(x) > for all x € ^4/. 

Proof: Let m £ be as above. We first claim that for e < 1/2, we necessarily have ||m £ || < 4. To see 
this, assume ||m e || = C. Then for < e < 1/2, we have 

^ > e||m e || > ||/ - m £ \\ - 1 > ||m e || - ||/|| - 1 = C - 2, 

whence C < 4. 

As ||m e || < 4, we have 

lim ellmJI = 0. 

Thus 

lim ||/ - m £ \\ = ll/H = min ||/ - m\\. 

£->0+ mgM 

From a compactness argument, it therefore follows that 
i.e., m £ tends to the zero function. Set 



lim \\m £ \\ = 0, 



m £ := m £ \\m £ \\ 

and let a £ = ||m e ||. Thus ||m e || = 1 for all e, and a £ tends to as e tends to 0. As the fh £ are all 
functions of norm 1 in a finite-dimensional subspace, on a subsequence e n — > + , we have 

lim m £n = rh 

n— >oo 

where rh £ M and ||m|| = 1. For convenience, let m n := rh £n , m n := m £n and a n := a £n . 

We claim that m(x)sgn/(x) > for all x G Af. Assume not. There then exists an i* e 
for which 

m(x*) sgn f(x*) = — c < 0. 

Thus for n sufficiently large, 

fh n {x*) sgn f(x*) < --, 

and therefore 

l + e n a n = \\f\\+£n\\m n \\> \\f -m n \\ 

> \f(x*) -m n (x*)\ = \f{x*)\ +a n \m n (x*)\ > 1 + a n ^. 

But this cannot possibly hold for n sufficiently large as e n — > 0. Thus m(x) sgn f(x) > for all 
x £ A f . 

Let /3 n = \\fh n — fh\\. Therefore f3 n tends to zero as n tends to infinity. Now 

[||/ - a n fh\\ - 11/11] - [||/ - a n m n \\ - ||/||] = ||/ - a n fh\\ - \\f - a n m n \\ 

— 1 1 CK^TTT. Oi n ?Tl n || Ct n f3 n . 

Thus 

< ||/ - a n m\\ - ll/H < ||/ - a n m n \\ - \\f\\ + a n (3 n < a n e n + a n /3 n = a n (e n + j3 n ). 
Set S n := e n + (3 n . Then 



lim 5 n = 



and this proves the lemma. □ 



A. Kroo and A. Pinkus 



18 



Recall that we assumed that G Pm(J)- This implies, by Theorem 2.3, the existence of k 
distinct points xi, . . . , x& G ^4/, 1 < A; < n + 1, and strictly positive values Ai, . . . , A& such that 

fc 

^ A i [sgn/(x i )]m(x i ) = 

for all m G M. 

Lemma 3.4. m(xj) = 0, for i = 1, . . . , fc. 
Proof: From Lemma 3.3 (ii), we have 

< m(xi)sgnf(xi) 

for alH = 1, . . . , k. Thus if we have strict inequality for any i, then we contradict the fact that 

k 

^2 Xi[sgn.f(xi)]fh(xi) = 0. □ 
i=i 

For each given n, let 

G n = {x: \m(x)\ < S n /2}. 

Note that G n is an open neighborhood of the {x\, . . . ,Xk}- We define a function 4> n G C(B) as 
follows. Firstly set 4> n {xi) = a n 5 n , i = 1, ... , fc, and 4> n (x) = for all x G B\G n . Note that on 
{xi, . . . , Xfc} U B\G n , we have 

< <^n(x) < |a n (5 n - a n |m(ar)| |. 

This follows from the fact that fh(xi) = 0, i = 1, . . . , k. Extend 4>n continuously to all of B so that 
it continues to satisfy 

< <p n (x) < \a n 5 n - a n \m(x)\ \ 
for all x G B. Now set g n (x) := f(x) [1 + <p n {x)}. 
Lemma 3.5. We have 

11/ - 9n\\ = Ot n 8 n . 

Proof: From the definition of g n , f(x) — g n (x) = —f(x)4> n (x). Since ||/|| = 1, we therefore have 

ll/-0n|| < Hnl 

On G n , where (fr n need not vanish, we have 

\m(x)\ < y. 

Thus for x G G n 

< (pn(x) < \a n 5 n - a n \m{x)\ \ = a n 5 n - a n \m(x)\ < a n 5 n . 
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Therefore ||/ — g n \\ < a n 5 n . Equality holds since 

\f(Xi) ~ g n {Xi)\ = \f(Xi)4> n (Xi)\ = \4> n {Xi)\ = a n 5 n . □ 

Lemma 3.6. We have 

min \\g n — m 

meM 

Proof: For each i = 1, . . . , k, since fh(xi) 

\g n {xi) - a n m(xi)\ = \g n (xi)\ = 1/(^)1 I 1 + <t>n{xi)\ = 1 + 4>n{xi) = 1 + a n 5 n . 

Now for x G B\G n , since <fi n (x) = 0, we have 

\g n (x) - a n m(x)\ = \ f(x) - a n rh(x)\ < \\f - a n m\\ < 1 + a n 5 n . 

The latter inequality is from Lemma 3.3 (i). For x G G„, since < 1 and |m(x)| < 5 n /2, we 

have 

\g n (x) - a n m(x)\ = \f(x) + f(x)(f> n (x) - a n rh(x)\ 

< 1 + 1 

< 1 + a n 5 n - a n \m(x)\ + a n \m(x)\ = l + a n 5 n . 

This proves that 

\\g n - a n m\\ = 1 + a n 5 n . 

As 

g„(xi) - a n m(xi) = g n (x{) = f{xi) [1 + a n 5 n ] 

for i = 1, . . . , k, it follows from Theorem 2.3 characterizing best approximants that a n fh is a best 
approximant to g n from M. □ 

Proof of Theorem 3.2 (cont'd): If (II) holds, then there exists a a > such that 

\\Puf -P M g\\<o-\\f -g\\ 

for all g G C(B). Recall that we assumed, without loss of generality, that the zero function is the 
best approximant to / from M and ||/|| = 1. Taking g = g n and applying the above lemmas, we 
obtain 

a n = \\a n m\\ < a\\f - g n \\ = aa n 5 n . 

Thus 1 < a5 n for all n. But lim n ^oo 5 n = 0. This is a contradiction. □ 

Based on Theorems 2.4 and 3.2, we now have a characterization for local Lipschitz continuity of 
the best approximation operator in the uniform norm from finite-dimensional subspaces. A weaker 
sufficient condition can be found in Kovtunec [1984]. 



a n m\\ = 1 + a n 5„ 



= 0, we have 
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4. Strong Uniqueness in Haar Spaces in the Uniform Norm 

We start with the definition of a Haar space. 

Definition 4.1. An n-dimensional subspace M of C{B) is said to be a Haar space if no nontrivial 
m 6 M vanishes at more than n — 1 distinct points of B. 

A unicity space is any subspace M of a normed linear space X with the property that each 
/ G X has a unique best approximant to / from M . The following result was proved by Haar 
[1918] and is built upon earlier results of Young [1908]. 

Theorem 4.1. An n-dimensional subspace M of C(B) is a unicity space if and only if it is a Haar 
space. 

As a result of this theorem, the term Haar space is often applied to any unicity space in any 
normed linear space. But here we use the term Haar space as that given in the above definition. 
There are many equivalent definitions of the Haar space property. Here are two that will prove 
useful. 

1) An n-dimensional subspace M of C(B) is a Haar space if and only if dimM|{ Xlj _ ^ Xn j = n for 
every choice of n distinct points xi, . . . , x n in B. 

2) Let mi, ■ ■ ■ , fn n be a basis for M. Then M is a Haar space if and only if 

det( mi (x 3 ))l 1=1 ? 

for every choice of n distinct points x\, . . . , x n in B. 

For n>2, there are rather restrictive conditions on B needed to ensure that C(B) can contain 
a Haar space of dimension n. Essentially, B must be homeomorphic to a subset of S . Exact 
conditions are the content of Mairhuber's Theorem; see Mairhuber [1956], Sieklucki [1958], Curtis 
[1959], Schoenberg, Yang [1961], and McCullough, Wulbert [1985]. 

When we have a Haar space, the characterization result Theorem 2.3 can be further strength- 
ened. 

Theorem 4.2. Let M be an n-dimensional Haar space on C(B). Given f G C{B), we have that 
m* is the best approximant to f from M if and only if there exist n + 1 distinct points x\, . . . , x n+ \ 
in Af_ m * and strictly positive values Ai, . . . , A„+i such that 

71+ 1 

Aj[sgn(/ - m*)(xi)]m(xi) = 

i=l 

for all m £ M. 

Proof: This does not differ much from Theorem 2.3. The difference is in the claim that the k 
therein must be n + 1. To see this, let 

Gi = Aj[sgn(/ - m*)(xi)], i = l,...,k. 

From Theorem 2.3, we have 

k 

(Tim{xi) = 

i=l 
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for all m G M. Let mi, ... , m n be any basis for M. Then 

fc 

^^772^X^=0, j = l,...,n. 

1=1 

li k < n, then this implies that 

rank(mj(xj))jLi" = i < 
since (<7i, . . . , <jfc) / 0. But this contradicts the fact that 

rank^-^i))"^! = n 

for all distinct yi, . . . , y n in B. Thus = n + 1. □ 

A Haar space on an interval is generally called a Chebyshev space (or Tchebycheff space) and 
often abbreviated a T-space. A basis for a T-space is sometimes called a T-system. For a T-space, 
because of the connectedness of the interval, we have: 

Proposition 4.3. Let mi, . . . , m n be any basis for M. Then M is a T-space if and only if 

e det(mi(xj))£j=i > 

for some fixed e £ {—1, 1} and all x\ < ■ ■ ■ < x n . 

For T-spaces, we can further specialize Theorem 4.2 into a final and more geometric form. We 

have 

Theorem 4.4. Let M be an n-dimensional T-space on C[a,b]. Given f £ C[a,b], we have that 
m* is the best approximant to f from M if and only if there exist points a < x\ < ■ ■ ■ < x n+ i < b 
and a S G {— 1, 1} such that 

(-iyS(f-m*)(x i ) = \\f-m*\\, i = l,...,n + l. 

Proof: (<=)■ Assume m* exists satisfying the above. If m* is not a best approximant to / from 
M, then there exists an in £ M for which 

||/-m|| < ||/-m*||. 

Then for each i G {1, . . . , n + 1}, we have 

(-iyS(f-m*)( Xi ) = \\f-m*\\ > ||/ -m|| 

> \{f-m)( Xl )\ > (-iyS(f-ih)( Xl ). 

Thus 

(-iyS(m-m*)(xi) > 0, i = l,...,n + l. 

It therefore follows that in — m* has at least one zero in each of the intervals (xj, Xj + i), i = 1,... ,n. 
That is, rh — m* has at least n distinct zeros on [a, b\. But ih—m* £ M where M is an n-dimensional 
T-space. Thus in = m*, which contradicts our hypothesis. 
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(=>). Assume m* is the best approximant to / from M. Then from Theorem 4.2, we have n + 1 
distinct points a < x\ < ■ ■ ■ < x n+ i < b in Af_ m *, and strictly positive values Ai, . . . , A n+ i such 
that 

n+l 

^2 Ai[sgn(/ - m*)(xi)]m(xi) = 

i=l 

all m £ M. 

We wish to prove that 

[sgn(/ - m*)(xi)][sgn(/ - m*)(x i+ i)] < 0, i = l,...,n. 

To this end, set 

a; = Aj[sgn(/ - m*)(xi)], i = 1, . . . ,n + 1. 
Thus, we wish to prove that 

(7j(jj + i < 0, i = 1, . . . ,n. 
Let mi, ... , m n be any basis for M. From the above, it follows that 

n+l 

1 Y^a i m j (xi) = 0, j = l,...,n. 

i=i 

Since 

rank^^))™^ 1 ™^ = n, 

we have 

a r = a(-l) r det^ ^))^ 1 ^! 

for all r = 1, . . . , n + 1, where a / 0. Thus from Proposition 4.3, 

oyo+i < 0, r = 1, . . . , n. □ 

The following was first proven in Newman, Shapiro [1963] where they introduced the concept 
of (classical) strong uniqueness. 

Theorem 4.5. Let M be a finite-dimensional Haar subspace of C(B). Then the unique best 
approximant to f from M is a strongly unique best approximant to f from M. 

Proof: The proof follows immediately from Theorems 2.4 and 4.2, and the definition of a Haar 
space that implies that we always have dimMj^^ ... tXn+1 y = n. □ 

Thus, when M is a finite-dimensional Haar space, the functional 7(/) is strictly positive on 
C(B). It also has certain simple properties. For example, it is easy to verify that j(af — m) = j( f) 
for all a G M, a ^ 0, and m G M. Other properties are less obvious and for good reason. 

For example, the functional j(f) is in general not a continuous function of /. This is because 
Af_ m * is a highly noncontinuous function of /. 

Example. Assume B = [—1,1] and M = span{l,x}. M is a T-space on C[— 1,1]. If Aj_ m * = 
{ — 1,0,1} with sign of the error +1, — 1, +1, respectively, then 7(/) = 1/3. However, if Af_ m * = 
{— 1, [—1/2, 1/2], 1} with signs +1, —1, +1, respectively, i.e., / — m* attains its norm positively at 
±1 and negatively on the full segment [—1/2,1/2], then 7(/) = 3/5. It is not at all difficult to 
construct an / and a sequence /„ of functions in C[— 1, 1], all with the zero function as their best 
approximants from M, and such that Af n = { — 1, 0, 1} for all n, while Af = {— 1, [—1/2, 1/2], 1}. 

While we do not have continuity of j(f), we do have upper semi-continuity when M is a 
finite-dimensional Haar space. This result is contained in Bartelt [1975] who writes that it is due 
to Phelps. 
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Proposition 4.6. Assume M is a finite-dimensional Haar subspace of C{B). Then the optimal 
strong unicity constant 7 is an upper semi-continuous function on C(B). 

Proof: Assume to the contrary that there exists a sequence of functions f n in C(B) that tend to 
/ G C(B) such that 

7(/n)>7(/) + £ 

for some e > and all n. Let m* G Pm{/) and m n G PM(fn) f° r each n. For given m G M, it 
follows from the definition of j(f n ) that 

||/n - m\\ > \\f n -m n \\+ j{f n )\\m -m n \\ > \\f n -m n \\ + ( 7 (/) + e)\\m - m n \\. 

Since M is a Haar space, it is well-known that the best approximation operator is a continuous 
operator. That is, from the fact that /„ converges to /, it follows that m n converges to to*. Thus it 
also follows that as n — > 00, we have that \\f n — m\\ tends to ||/ — m\\, \\f n — m n \\ tends to ||/ — to* || 
and 1 1 to — m n \\ tends to \\m — m*\\. Taking limits in the above inequality, we obtain 

11/ -m|| > ||/- m *|| + ( 7 (/)+e)||TO-TO*||. 

Since this holds for every to G M, this contradicts the definition of 7(/). □ 

As noted previously, the reason for the lack of continuity of 7(/) is in the fact that yl/_ m * is 
highly noncontinuous. When we have continuity of this set, then we have continuity of the j(f). 
To explain what we mean, let us first define a distance between sets. In what follows, we assume 
that B is a compact metric space. For sets C, D C B, we let 

d(C,D) = sup mtp(x,y) 

where p is a metric on B. There are many different such 'distances'. We will make do with this 
one. This result is contained in Bartelt [1975] 

Proposition 4.7. Assume M is a finite-dimensional Haar subspace ofC(B), where B is a compact 
metric space. Let f n be a sequence in C(B) converging uniformly to f G C(B)\M. Let m n G 
P M (fn) and to* G P M (f). If 

lim d(A fn - mn ,A f - m .) = 0, 

n— >oo 

then 

lim 7 (/n) = 7(/)- 

n— >oo 

Proof: From Proposition 4.6, it follows that we need only prove that for any given e > there 
exists an TV such that for all n > N, we have 

7(/n) + e>7(/)- 

We start by simplifying things somewhat. Set g n := f n — m n and g := f — to*. Then 
A 9n = A fn _ mn and A g = A f _ m *, j(g n ) = j(f n ) and 7(5) = 7 (/), and the zero function is a 
best approximant from M to each of the g n and g. Since M is a Haar space, we again have that 
to„ converges to to*. Thus the g n converge to g, and g £ M. We will, without loss of generality, 
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assume ||<? n || = \\g\\ = 1 for all n. We also use the fact that the unit sphere in M is uniformly 
equicontinuous on B. That is, given e > 0, there exists a 5 > such that if p(x, y) < S then 

\m(x) — m(y)\ < e 

for all m 6 M, \\m\\ = 1. With these preliminaries, we can now prove the desired result. 
By assumption, given e > 0, there exists a 5\ > such that if p(x, y) < 5\ then 

\m(x) — m(y)\ < e 

for all m £ M, ||m|| = 1. Similarly from the uniform continuity of g, there exists a e>2 > such 
that if p(x, y) < 62 then 

\g(x)-g(y)\ < 1/2. 
By assumption, there exists an Ni such that for all n > Ni, we have 

||0n-0ll < 1/2. 

And finally, by assumption, given 5 > 0, there exists an A^2 such that for all n > N2, we have 
^(^4^, A g ) < 5, i.e., for each x £ A g , there exists ay £ ^4 9n such that p(x,y) < 5. For our given 
e > 0, we therefore set 5 = min{<5i, 5 2 } and iV = maxjA^i, A^}. 
Given any m G M, ||ra|| = 1, let x* G A 5 satisfy 

maxfsgn <7(x)]m(x) = [sgng(x*)]m(x*). 
By assumption for all n > N, we have 

Thus there exists a y* £ A gn such that p(x*,y*) < 5 implying that 

\m{x*) — m(y*)\ < e. 
In addition, as |</(x*)| = 1, n > N and p(x*,y*) < 6, we have 

sgns-(x*) = sgn g(y*) = sgn g n (y*). 

Thus 

max[sgng(x)]m{x) = [sgn^(x*)]m(x*) < [sgn g n (y*)]m(y*) + e. 
xeA g 

As this is valid for all m £ M, \\m\\ = 1, it implies that for all n > N 

l(f)=l(9) <l(9n)+e = j(f n ) + e. □ 

Remark. In Propositions 4.6 and 4.7, we did not really use the full Haar space property. We used 
the fact that the / and /„ all had strongly unique best approximants from M. 

It would be desirable if we could bound 7(7) from below away from zero for all / 6 C{B) 
and thus dispense with the dependence upon /. This is certainly possible if M is a Haar space of 
dimension 1. In this case, M = spanjm} where m € C(B) has no zero, and from the definition of 
7(/), we obtain 

\ui( x) 

7(f) > min |m(x)| = min — — — > 0, 

||m|| = l 11 11 

a lower bound independent of /. 

Moreover, if B is a finite set, then we can always uniformly bound 7(/) from below away from 
zero if M is a Haar space. This follows simply from a compactness argument. However if B is not 
a discrete set and M is a Haar space of dimension n, n > 2, then 7(7) cannot be bounded away 
from zero. The following is a variation on a result proved in Cline [1973]. 
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Proposition 4.8. Let B be a compact Hausdorff space that is not discrete. Assume M C C(B) 
is a Haar space of dimension n, n > 2. Then 

inf 7 (/) = 0. 

/EC(B) ' KJ ' 
11/11 = 1 

Proof: Let x be any accumulation point of B, i.e., every neighbourhood of x contains at least 
one other point (and thus an infinite number of points) of B. As M is of dimension n, n > 2, 
there exists an m G M satisfying mix) = and ||m|| = 1. Given e > 0, choose n + 1 arbitrary 
distinct points {xj}™^ in 5 where |m(x;)| < £. Such points can all be chosen from a suitable 
neighbourhood of x. Since M is a Haar space of dimension n, there exist ai, . . . , a n+ i, all nonzero, 
such that 

re+l 

^a;m(x;) = (4.1) 

i=l 

for all m G M. Construct an / G C(-B) satisfying /(xj) = sgna^, i = 1, . . . , n + 1, ||/|| = 1 and 
II/ — m|| < Such a construction is clearly possible since | (/ — m)(xi) | < |/(xj)| + |m(xj)| < 1+e 
for all i = 1, . . . , n + 1. 

From (4.1) and the fact that /(xj) = sgna i; i = 1, . . . , n + 1, we have that G Pu{f)- Thus 
from the strong uniqueness property of Haar spaces, we necessarily have 

11/ -m||- H/ll > 7 (/)||m|| 

implying 

e = (1 + e) - 1 > ||/ - m|| - ||/|| > 7 (/)||m|| = 7 (/), 

i.e., t(/) < e. Thus 

inf 7 (/) = 0. □ 

f€C(B) ,KJ ' 
11/11 = 1 

While there is no uniform strictly positive lower bound on 7 (/), there are results concerning 
sets in C(B) over which the strong uniqueness constants are uniformly bounded from below by 
a positive constant; see for example Bartelt, Swetits [1983], Marinov [1983], and Bartelt, Swetits 
[1988]. In addition, there are specific bounds on 7 (/) related to the structure of the Af_ m *. To 
explain how these latter bounds arise, we start with a construction. 

As usual, we assume that M is an n-dimensional Haar space in C(B). Given any n + 1 distinct 
point xi, . . . ,x n _|_i G B, there exist a^, all nonzero, such that 

n+l 

^ o.im(xi) = 

i=l 

for all m G M. Normalize the a, so that 

71+ 1 

i=l 

As M is a Haar space of dimension n, there also exist mj G M satisfying 



m j {x i ) = sgnaj, i = 1, . . . ,n + 1, i ^ j. 
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Thus for each j £ {1, . . . , n + 1}, 

n+l n+l 



= &i m j{Xi) = a j m j( x j) = (1 — \ a j\) + a j m j(, X j)i 

i=l i=i 

and therefore 

[sgn a,j]mj(xj) = 1 — -j — r < 0. 

These rrij, j = 1, . . . ,n + 1, will play a special role in determining and bounding 7(/). The nij 
were introduced in Cline [1973]. The important property of the nij is the following. 

Proposition 4.9. Assume M is an n-dimensional Haar subspace ofC(B). Given any n+l distinct 
points {xil^j] 1 in B, let the {ai}™^ 1 and {mj}™=l be as constructed above. Then 

min max [sgn aAmixi) = min -. 

meM i=l,...,n+l L j = l,...,n+l m, 

||m||=l 11 J " 

Proof: We first prove that any n of the above {m.j}™=l are linearly independent, i.e., span M. 
Assume not, and without loss of generality let us assume that ni2, ■ ■ ■ , m n+ i are linearly dependent. 
Thus 

n+l 



^ b k m k = 



k=2 

for some nontrivial choice of 62, ... , 6 n +i- At x\, we have mk{x\) = sgnai for all k = 2, . . . , n + 1. 
Thus 

n+l 

£ 6 fc = 0. 

fc=2 

At Xj, j / 1, we have 

n+l n+l 



= [sgn a,] ^ b k m k (xj) = sgn «//>;///,•(.(•,• ! + ^ 6 fc 

As |aj| > 0, we have bj = for all j 7^ 1, a contradiction. 
Let m £ M be normalized so that 

max [sgnai]m(xi) = 1. 

i=l,...,n+l 

Note that the above maximum is always strictly positive for any m £ M, m 7^ 0, since we have 
Y17=i a i m i x i) = 0) an d m 7^ can on ly vanish at at most n—1 distinct points. Assume 



[sgn a r ]m(x r ) = 1. 
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At x r , we have 



n+l 

i=i 



n+l 



n+l 



1 = [sgn a r ]m(x r ) = a [sgn a r \mj (x r ) = q. 



»=i 



We now prove that c, > for all 7 as above. To this end, note that for j G {1, . . . , n + l}\{r}, 



n+l 



n+l 



1 > [sgn a j]m(xj) = [sgnaj] gmi(xj) = Q + [sgnaj]cj77ij(xj) 



i=l 



i=l 



= (l- c ,) + c,(l- — ) = 1- T ? 7 . 



Thus 



1 > 1 



and as |a 3 -| > 0, this implies that Cj > 0. 



Returning to our definition of m, we have, using the fact that a > and ^7=i Q = 1, 



n+l 



n+l 



m 



ECirriiW < > Cjllmjll < max [Im; 
^— ' i=l,...,n+l 



That is, for every m G M satisfying 



max [sgn ai]m(xi) = 1, 

= l,...,n+l 



we have 



1 1 
> min 



771 j' = l,...,n+l 771 



In addition, we have 



max [sgn ai]mj(xi) = 1 

= l,...,n+l 



for j = 1, . . . , 7i + 1. □ 
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Let / G C(B)\M and m* be the best approximant to / from M. Recall that as M is an 
fi-dimensional Haar subspace of C(-B), there exist n + 1 distinct point a?i, . . . , £C n _|_i G Af_ m * and 
strictly positive Ai, . . . , A n+ i, with Y^i=i ^« = 1> satisfying 



n+1 



^ Ai[sgn(/ - m*)(xi)]m(xi) = 



i=l 

for all m 6 M. In what follows, we let the G M, j = 1, . . . , n + 1, be as constructed above, with 
respect to these x\, . . . , x n+ \ and aj = Aj[sgn(/ — m*)(xj)], i = 1, . . . , n + 1. From Proposition 4.9, 
we immediately obtain the following results. 

Theorem 4.10. Assume M is an n-dimensional Haar subspace of C(B). Let f G C(B)\M and 
m* be the best approximant to f from M. Let the mj, j = 1, . . . , n + 1, be as constructed above. 
Then 

7(/) > . miri 



j = l,...,n+l 1 1 771 j | 



Furthermore, if Af_ m * = {x\, . . . , x n+ i} then 



l(f) = . _ min 



j = l,...,n+l ||777j|| 

Proof: We have that 

7(/) = min max sgn(/ — m*)(x)m(x) 

||m|| = l ^ 

> min max sgn(/ — m*){xi)m{xi 

m£M i = l ... n+1 
||m|| = l 

From Proposition 4.9, we see that this latter quantity equals 

1 

min -. 



j = l,...,n+l \\m,j\\ 

This proves the first statement of the theorem. The second statement follows analogously since the 
inequality is then an equality. □ 

The first statement in Theorem 4.10 is in Cline [1973]. The second statement follows easily 
from results in Cline [1973] together with the characterization of Theorem 2.2, as noted in Henry, 
Roulier [1978]. According to Blatt [1986], the first inequality was partially implicitly arrived at in 
Freud [1958]. 

If Af_ m * has exactly n + 1 points, then from the above construction each of the rrij is "ad- 
missible", i.e., satisfies 

max [sgn(/ — m*)(x)]mj(x) = 1, j = 1, . . . , n + 1. 

x€A f _ m * 

However, if Af_ m * has more than n + 1 points then it may be that some of the above constructed 
rrij satisfy 

max [sgn(/ — m*)(x)]rrij(x) > 1. 

x€Af_ m * 
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This is why equality need not hold in the first part of Theorem 4.10. 

Given Af_ m * , it is also not clear, a priori, which choices of n + 1 points X\, . . . , x n+ \ in Af_ m * 
can be nade so as to satisfy 

n+l 

^2 Aj[sgn(/ - m*)(x i )]m(x i ) = 

i=l 

for all to G M with Aj > 0, i = 1, . . . , n + 1 (except of course if Af_ m * contains only n + l points). 
However if B is a connected interval then the "eligible" x\ < ■ ■ ■ < x n+ \ are exactly those for which 
[sgn(/ — m*)(xi)][sgn(/ — m*)(xi + i)\ < 0, i = 1, . . . , n, (see Theorem 4.4). 

In Bartelt, Henry [1980], there is an example of a function / for which 7(/) is strictly greater 
than min^i^.^+i l/||mj|| when one varies over all possible "eligible" {xi}™^ (see the previous 
paragraph). It is the following. 

Example. Let M = Hi be the polynomials of degree < 1 in C[0, 1]. Let / G C[0, 1] be piecewise 
linear with nodes /(0) = /(1/2) = /(l) = 1 and /(1/4) = /(3/4) = -1. It can be readily 
verified that 7(/) = 3/5 while the maximum of mmj = i t _^ n+ i l/||mj|| where we choose 3 arbitrary 
alternating points is 1/5 (attained choosing the points 1/4, 1/2,3/4). 

A simple consequence of Theorem 4.10 is the following, as was noted in Blatt [1984b]. 

Corollary 4.11. Assume M is an n-dimensional Haar subspace of C(B). Let f G C(B)\M and 
m* be the best approximant to f from M. Assume Af_ m * = {x\, . . . ,x n+ \}. Then 

id) < ±. 

Proof: From Theorem 4.10, we have 

7(/) = min -. 

j = l,...,n+l mJ 



From the construction of the nij 



Since > and Y17=i ^ = 1' there exists a Aj satisfying Aj < l/(n + 1). Thus, for this j, 

\\ mj \\ > \mj(Xj)\ = - 1 > n, 
A i 

implying j(f) < 1/n. □ 

This bound of 1/n is independent of both B and M. There is no reason to assume that it is 
attainable. However Cline [1973] proved that if B = [—1, 1], M = TL n -i = spanjl, x, . . . , x n_1 }, and 
the {xi}™=i are the extreme points of the Chebyshev polynomial of degree n, then 7(/) = l/(2n— 1). 
So at least in these cases the order 0(n) is attainable. 

If M = n n _i and B = [a,b], then it is well known that if / G C n [a,b] and f( n \x) is strictly 
of one sign for all x G [a,b], then Aj_ m * has exactly n + l points. As such, it follows from 
Corollary 4.11 that the strong unicity constant associated with, for example, e x necessarily tends 
to as n t oo. It has been shown by Gehlen [1999] that if / is not a polynomial, then necessarily 
liminf 7 ra (/ ) = 0, where 7 n (/) is the strong unicity constant for / on [a,b] with respect to the 



\ m j( x j)\ = -r- ~ !■ 
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polynomial approximating space H n -\. This was formally conjectured in Henry, Roulier [1978] 
and is sometimes called the Poreda conjecture; see Poreda [1976]. A lot of work went into this 
conjecture until it was finally proven by Gehlen. It should be noted that the above liminf cannot 
be replaced by the simple limit. There are functions / G C[a, b] for which 

liminf j n (f) = 0, limsup7 n (/) = 1; 

n->oo n->oo 

see Schmidt [1978]. From this example, we also see that 7 n (/) is not, in general, a monotone 
function of n. Nothing is known concerning the behavior of the associated 7„(/) on other dense 
subspaces. 

If Af_ m * contains more than n + 1 points, then via the ||mj||, we cannot necessarily calculate 
the exact value of 7(/). It is therefore natural to ask how we can calculate 7(/). The following, 
from Schmidt [1980], characterizes the elements of M that attain the minimum in the formula for 
the determination of 7(/). 

Theorem 4.12. Assume M is an n-dimensional Haar subspace of C(B). Let f G C{B)\M and 
m* be the best approximant to f from M. Assume m G M, \\rh\\ = 1, satisfies 

7(/) = ma x [sgn(/ - m*)(x)]rh(x). 

x£A f _ m * 

Then given any x* G B satisfying 

|m(x*)| = ||m||, 

there exist points x\,...,x n G Af_ m * and strictly positive values Ai, . . . , X n +i such that 

[Sgn(/ - 777,*)(Xi)]777(Xi) = -f(f), 1 = 1, . . . , 77, 

and 

n 

^ Ai[sgn(/ - 777*)(xi)]777(xi) - \ n+1 [sgnfh(x*)]m(x*) = 

i=l 

for all 777 G M. 

Proof: There necessarily exists an 777 G M, \\m\\ = 1, satisfying 

7(/) = max [sgn(/ — 777*)(x)]777(x). 

x£A f _ m * 

Set 

D := {x : x G A f _ m *, [sgn(/ - 777*)(x)]t77(x) = 7 (/)} 

and let x* be as above, i.e., |m(x*)| = ||m||. We first claim that there does not exist an m G M 
satisfying 

[sgn(/ — 777*)(x)]777,(x) < 0, x G D, 

and 

777(x*)777(x*) > 0. 

For if such an m existed, then for e > sufficiently small 777 e = 777 + em would satisfy 

[Sgn(/ - 777*)(x)]777 £ (x) < X G A f _ m * 
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and 

||m £ || > ||m|| = 1, 

contradicting the fact that 

j(f) = min max sgn(/ — m*)(x)m(x) = max sgn(/ — m*)(x)m(x). 

m€M x€ A f _ m * x€A f _ m * 
||m|| = l 

Let mi, ... , m n be any basis for M. For each x G D, set 

M(x) = ([sgn(/ — m*)(x)]mi(x), . . . , [sgn(/ — m*)(x)]m n (x)) G I", 

and let 

N(x*) = (— [sgnm(x*)]mi(x), . . . , — [sgnm(x*)]m n (a;)). 
As M is a linear subspace and there exists no m G M such that 

[sgn(/ — m*)(x)]m(x) < 0, x G D 

and 

— [sgnm(x*)]m(a;*) < 0, 

it follows that G MP is in the convex hull of {M(x)} x€ e> L)N(x*). From Caratheodory's Theorem, 
this implies that is a convex combination of at most n + 1 points in this set. If these k points, 
1 < k < n + 1, do not include N(x*), then there exist Aj > 0, J2i=i = 1> suc ^ that 



^2 Ai[sgn(/ - m*)(xj)]m(a;j) = 



i=l 

for all m G M. Substituting m = m, and since [sgn(/ — m*)(xj)]m(xj) = 7(/) > for i = 1, . . . , k, 
we get a contradiction. Thus there exist Aj > 0, Yli=i = 1> ^fc > 0) sucn that 

fe-i 

^ Ai[sgn(/ - m*)(x i )]m(x i ) - X k [sgnm(x*)]m(x*) = 

i=l 

for all m G M. In fact, since M is a Haar space of dimension n, we must have k = n+ 1 and Aj > 
for alii = 1, . . . , n + 1. □ 

Example. Let us return to the example presented just before Corollary 4.11, namely let M = Yl\ 
be the polynomials of degree < 1 in C[0, 1]. Let / G C[0, 1] be piecewise linear with nodes 
/(0) = /(1/2) = /(l) = 1 and /(1/4) = /(3/4) = -1. It can be readily verified that 7 (/) = 3/5 
while the maximum of mm^i^.^+i l/||mj|| where we choose 3 alternating points is 1/5 (attained 
choosing the points 1/4, 1/2, 3/4). The fh that gives this 7(/) = 3/5 is obtained by considering the 
linear function fh satisfying m(l/4) = —3/5 and m(l) = 3/5, i.e., rh(x) = (8x — 5)/5. 

Remark. In the case where B is well-ordered and M is a T-system thereon, then the x\ < ■ ■ ■ < x n 
as in the statement of Theorem 4.12 are such that either f — m* strictly alternates thereon, in which 
case the fh takes its norm at x* < x\ with sgnm(x*) = sgn(/ — m*)(xi) or fh takes its norm at 
x* > x n with sgnm(x*) = sgn(/ — m*)(x n ), or f — m* strictly alternates thereon except for having 
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the same sign at X\ and for some i G {1, . . . , n} and m takes its norm at Xi < x* < Xi + \ with 
sgnm(x*) = sgn(/ - m*)(xi) = sgn(/ - m*)(x i+1 ). 

For which subspaces of C(B) are uniqueness and strong uniqueness equivalent? That is, which 
subspaces M have the property that if / G C(B) has a unique best approximant from M, then 
it is also a strongly unique best approximant? We know that they are equivalent if M is a finite- 
dimensional Haar space since in this case, we always have both uniqueness and strong uniqueness of 
the best approximant. In McLaughlin, Somers [1975], it is proved that if B is an interval [a, b], then 
uniqueness and strong uniqueness for all functions in the space are equivalent if and only if M is a 
Haar space, i.e., if M is not a Haar space on C[a, b] then there exist functions with a unique best 
approximant, but not a strongly unique best approximant. The proof is difficult and complicated, 
and is restricted to the case B = [a,b]. 

If B is a nonconnected set, there may exist subspaces M of C(B) such that uniqueness and 
strong uniqueness are equivalent, and yet M is not a Haar space. As an example, consider M = 
span{(l,0)} in M 2 . Then for / = (x,y), we can have / G M, i.e., y = 0, in which case we trivially 
have both uniqueness and strong uniqueness of the best approximant. On the other hand, if / ^ M, 
i.e., y / 0, then / never has a unique best approximant. The vectors c(l,0) are best approximants 
to (x, y) from M for all c G [x — \y\, x + \y\ ] as for all such c, we have 

HO, y) - c(l, 0) || = max{|x - c|, \y\} = \y\. 

Thus uniqueness and strong uniqueness hold simultaneously or do not hold at all. A more general 
result is the following 

Proposition 4.13. Assume B is a compact HausdorfF space and M is an n-dimensional subspace 
of C(B). Assume B = B\ U ■ ■ ■ U B^ where each Bj is both open and closed, M\s j = Mj is a Haar 
space of dimension nj (assuming nj > 1) on Bj , and 

k 

Y^nj =n. 

j=i 

That is, we assume Mj (if nj > I) has a basis of functions that vanish off Bj. Then uniqueness 
and strong uniqueness are equivalent on C(B). 

Proof: From the above assumptions, we can write 

m = Mi e m 2 e • • • e M k 

and each m G M has a unique decomposition of the form 

k 

m = m ] 
i=i 

where rrij G Mj (and rrij vanishes off Bj). Thus 

k 

\\f -m\\ B = \\f -^2mj\\ B = max ||/-m|| B . = max \\f - mj \\ Bj ■ 
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We first claim that / has a unique best approximant from M if and only if 

min||/-m|| B = min ||/-m,-|| B , 

for each j G {1, . . . , k} for which rij > 1. Assume m* G M is a best approximant to / from M, and 
there exists an £ £ {1, . . . , k} for which ng > 1 and 

min ||/ - m\\ B > min ||/-m^|| B£ . 

mgM rni^Mt 

Let G M^ be the best approximant to / from Mi on Be. As nt > 1, there exists an G M^, 
fhe ^ 0, and vanishes off B^. Thus 

min ||/ - m\\ B > \\f - (m* e + em e )\\ Be 

for all e sufficiently small, implying that m* + erht are best approximants to / from M for all e 
sufficiently small. Thus, if / has a unique best approximant from M then 

min||/-m|| B = min ||/ - m,-|| B 

for each j G {1, . . . , k} with rij > 1. 
Let us now assume that 

min ||/ - m|| B = min ||/ - mj \\ Bj 

for each j G {1, ...,&} for which rij > 1. We claim that in this case, we always have strong 
uniqueness that also, of course, implies uniqueness. Let m* G Mj be the unique best approximant 

to / from Mj on Bj (that vanishes off Bj). Set m* := X^=i m j- Thus m* is a best approximant 
to / from M and 

\\f-m*\\ B = \\f-m*\\ Bj 

for all j with rij > 1. As Mj is a Haar space (assuming rij > 1) on Bj, there exists a 7j(/) > for 
which 

11/ - mj\\ Bj - ||/ - m*|| B . > 7j(/)ll"ij - "iJHbj 

for all mj G Mj. Set 

7(/) := j= min ifc 7i(/)- 

Let m G M and m = Sj=i 771 j where rrij G Mj. For each j G {1, . . . , k} with rij > 1, we have, 
since ||/-m*|| B = \\f-m*\\ Bj , 

11/ -ti||b - ||/-m*|| B > \\f -mj\\ Bj -\\f -m*\\ Bj 

> ljU)\\ m j - m *j\\B, 

> 7(/)||m,-m*|| Br 

If rij =0, then mj = m* = and thus 



11/ - m|| B - ||/ - m*|| B > = 7(/)ll r "i ~ ™*j 
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As we now have 



||/-m|| B - ||/-m*|| B > 7(/)ll"*j -m*\\ Bj 



for all j = 1, . . . , k, this implies that 



11/ - m\\ B - ||/ - m*\\ B > l(f)\\m - m*\\ B . 



□ 



It would be interesting to determine exact necessary and sufficient conditions for when unique- 
ness and strong uniqueness are equivalent on C{B). 

Unfortunately, the conditions of Proposition 4.13 are not necessary. For example, consider the 
space M = span{mi,m2} onB= [0, 1] U [2, 3] where 



In this example, uniqueness and strong uniqueness are equivalent, but this example does not satisfy 
the conditions of Proposition 4.13. Uniqueness and strong uniqueness are equivalent because for 
uniqueness of the best approximant from M to hold it is necessary that the error function attains 
its norm alternately at least three times in [0, 1] U [2,3], and two of these alternants must be in 
[0, 1]. Details are left to the reader. But in these cases, we also get strong uniqueness. This latter 
fact follows from Theorem 2.4. 

For dimM = 1, we have the following result. The argument is from McLaughlin, Somers 



Proposition 4.14. Assume B is a compact Hausdorff space and M = spanjm} is a 1-dimensional 
subspace of C(B). Assume uniqueness and strong uniqueness from M are equivalent on C(B). 
Then either M is a Haar space on B, i.e., m vanishes nowhere on B, or B = Bi U B 2 where Bi,B 2 
are both open and closed, and m vanishes nowhere on B 1 but vanishes identically on B 2 . 

Proof: To prove the proposition, it suffices to prove the following. If xq G B is such that m(x ) = 0, 
then m vanishes identically in a neighbourhood of xq. Assume this is not the case. That is, there 
exists an xo £ B such that m(xo) = and m does not vanish identically in any neighbourhood of xq. 
Then, we can choose some xq such that m(xo) = and there exists a small, closed, nondegenerate 
set N such that xq G dN and em(x) > for all x G A^\{xo} for some e G { — 1, 1}. We assume, 
taking cm if necessary for some c G M, that e = 1 and ||m|| = 1. Let (x n ) be a sequence of 
points in iV converging to Xq. Thus m(x n ) > for all n, and m(x n ) converges to zero. Let 
y G B\N with m(y) / 0. We define / G C(B) as follows. First set f(y) = — sgnm(y) and 
f{x n ) = 1 — m 2 (x n ). Note that f(x ) = 1. Extend / to be in C(B) satisfying ||/|| = 1 and 



As ll/H = 1 and f(x ) = 1, it follows that the function is a best approximant to / from M. 
From the construction of /, we have Aj = {xo,y}. A simple calculation gives 




x G [0, 1] 
x G [2, 3] 



and 




x G [0, 1] 
x G [2, 3] ' 



[1975]. 



< 1 for all x G B\{x ,y}. 



7(/) = minmax{±m(xo), T(sgnm(y))m(y)} = 0. 



Thus, we do not have strong uniqueness of the best approximant to / from M. It remains to prove 
we have uniqueness. 
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For a > 0, 

\f(y) - am(y)\ = | - sgnm(y) - a(sgnm(y))|m(y)|| = 1 + a\m(y)\ > 1, 

implying am is not a best approximant for a > 0. Assume a < is such that am is a best 
approximant. Thus for each x n , we have 

\f(x n ) - am(x n )\ = |1 - m 2 (x n ) - am(x n )\ < 1. 

This implies that we must have 

for all n. But a < and m(x n ) > converges to zero, which is impossible. Thus the function is 
the unique best approximant to / from M. □ 



5. Classical Strong Uniqueness in the L 1 Norm 

In this section, as the title indicates, we will study strong uniqueness when our approximating 
norm is L 1 . To this end, let us assume that K is a set, S a cr-field of subsets of K and v a positive 
measure on S. By L X (K, v), we mean the set of real- valued functions / that are ^-measurable, and 
such that | /| is integrable. The norm on L l (K, u) is given by 



|/(aOI<M*). 

K 

Two important sets when approximating in this L l (K, v) norm are the following. For each 
/ G L l (K, v), we define its zero set as 

Z(f) = {x : f(x) = 0} 

and also 

N(f) = K\Z(f). 

Note that Z(f) is z/-measurable. 

To understand when classical strong uniqueness might hold, we first calculate the r+(f,g) of 
(1.1). 

Theorem 5.1. For given f,g £ L l (K, v), we have 

T+(f,g)= / \g(x)\du(x) + / [sgnf(x)]g(x)du(x). 
Jz(f) Jk 

Proof: Let us assume that f,g^0. By definition, 

r+(/ ,,)_ Um J!i±MiH!Z!i. 
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For t > 0, 



K 



(\f + tg\-\f\)dv 



[ t\g\dv+ [ (\f + tg\-\f\)du 

JZ(f) JN(f) 



\9\^ + \ 



On N(f), 



and 



z{f) 

1/ + tg\-\f\ 



(\f + tg\-\f\)du 



N(f) 



<\g\ 



\f + tg\-\f\ \f + tg\ 2 -\ff 



t[\f + tg\ + \f\] 
Vg + t\g\ 2 



Thus on N(f), 



lim 



\f + tg\ + \f\ 

\f + tg\-\f\ 2fg 



sgnf]g. 



t 2|/| 

Applying Lebesgue's Dominated Convergence Theorem, we obtain 



(f,g) = \g\du+ [sgn/]# 
Jz(f) Jk 



dzA □ 



As a consequence of Theorem 5.1, Theorem 1.2 and Theorem 1.4, we have: 

Theorem 5.2. Let M be a linear subspace of L l (K, u) and f G L l {K, u)\M. Then m* G PmU) 
if and only if 



[sgn(/ — m*){x)\m{x) dv(x] 



K 



Z(f-m-) 



\m(x)\ dv{x) 



for all m £ M. Furthermore, m* £ M is a strongly unique best approximant to f from M if and 
only if 

n f(f)= inf < / \m{x)\dv(x) + / [sgn(/ — m*)(x)]m(x) dv(x) > > 0. 

||™fi=i {JzU-m*) JK J 

The first statement in this theorem can be found in James [1947, p. 291]. 

Recall (see Proposition 1.6) that if M is a finite-dimensional subspace, then the infimum in 
the definition of 7(/) is a minimum, i.e., it is attained. To prove strong uniqueness, it therefore 
suffices to verify that 

/ \m(x)\dv(x) + I [sgn(/ — m*){x)]m{x) dv(x) > 

JZ(f-m') Jk 
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for each m G M, \\m\\ = 1, and that is equivalent to showing that 



/ [sgn(/ — m*)(x)]m(x) &v(x) < / \m(x)\du(x) 

Jk JZ(f-m') 

for all m G M, m / 0. 

We consider two scenarios. In the first case, v is a discrete positive measure with a finite 
number of points of support, and in the second case v is a non-atomic positive measure. The two 
cases radically differ. 

Assuming v is a discrete positive measure with a finite number of points of support, we are 
effectively considering approximation in the £j™(w) norm where 



^T(w) : = l x = {xi,...,x m ) : x G 



x 



X] \ X i\ W i} 



i=l 



and w = (wi, . . . ,w m ) is a fixed strictly positive vector (weight). In this case, it transpires that 
uniqueness and strong uniqueness are equivalent. It is also true that for a fixed weight w most 
subspaces are unicity spaces, i.e., for most subspaces M there is a unique best approximant to each 
vector x from M. However for some subspaces, we do not have uniqueness of the best approximant. 
For example, in t\ with weight (1,1), i.e., norm ||x|| w = |xi| + \x2\1 and M = span{(l,l)} then 
to each and every x £ M, there is no unique best approximant. On the other hand, all the 
1-dimensional subspaces in i\ other than M = span{(l, ±1)}, are unicity spaces. 

The equivalence of uniqueness and strong uniqueness in this setting is a consequence of the 
following, that is essentially contained in Rivlin [1969, Theorem 3.6]; see also Watson [1980, p. 122] 
and Angelos, Schmidt [1983]. 

Theorem 5.3. Let M be a finite-dimensional subspace of ^™(w) and x G M. m . Then m* G M is 
the unique best approximant to x from M if and only if 



^[sgn(xi - m*)]m i w 1 



i=i 



< ^2 \ m i\ W i 
ieZ(x-ra*) 



for all m G M, m / 0. Thus every unique best approximant to x from M is also a strongly unique 
best approximant to x from M. 

Proof: Assume m* G M is the unique best approximant to x from M. Thus, by the characteriza- 
tion of the best approximant, we have 



^[sgn(xi - m*)]miW. 

i=l 

for all m G M. Assume m G M, m / 0, is such that 

m 

^[sgn(xi - m*))rhiW. 



< ^ \ m i\ W i 
ieZ(x-ra*) 



i = l 



^2 \ m i\ W i- 
ieZ(x-m*) 



We shall prove that the uniqueness of the best approximant implies that such a m cannot exist. 
From Theorem 5.2, this implies the equivalence of uniqueness and strong uniqueness. 
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Since we are in the finite-dimensional subspace R m , 

min{|xi — m*\ : i ^ Z(x — m*)} = c > 0. 
Thus there exists an e > such that for all t satisfying \t\ < e, we have 

sgn(xi - ml) = sgn(xi - m* - trhi) 
for each i £ Z(x — m*). For each such t, 



x — m — tm I 



x — m 



= ^ ~ m i ~ tm i\ ~ \ X i ~ m *i I) W i 

i=l 

m 

= ^[sgn(xi - m*)](xi - m* - trhi ~ %i + m*) 



i=l 

+ \ tm i 
ieZ(x-m*) 



= -ty~^[sgn(xj - m*)]miWi + |t| ^ \rrii\wi. 

ieZ(x-m') 



i=l 



By assumption, 



Thus 



^[sgn(xi - m*)]miW, 



x — m — tm 



i=l 



x — m 



^ |miK- 

ieZ(x-m*) 



-t^[sgn(xi - m|)]mjiyj + \t\ 



i=i 



^[sgn(xi - m*)]miW, 



i=i 



x — m 



x — m 



If i[sgn(xi — m*)]fhiWi = 0, then for all such t, i.e., \t\ < e, we have 

||x — m* — tm|| w 
If S (Y™ = l [sgn(xj - m*)}fhiWi) > 0, 6 G {-1, 1}, then 

||x — m* — tm|| w 

for all t satisfying < tS < e. In either case, we have nonuniqueness of the best approximant. □ 

In the second case, we assume that v is a non-atomic positive measure. In this case, neither 
strong uniqueness (nor uniqueness) is always present, but it is nevertheless around. This next result 
can be found in Angelos, Schmidt [1983]. 

Theorem 5.4. Let v be a non-atomic positive measure. Let M be a Unite-dimensional subspace 
of L l (K, v). Then the set of f G L (K, v) that have a strongly unique best approximant from M 
is dense in L l {K, v). 

Before proving this result, we recall another characterization of the best approximant via linear 
functional that, since v is a non-atomic positive measure, has the following form. This result was 
first proved in Phelps [1966]. 
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Theorem 5.5. Let v be a non-atomic positive measure. Let M be a finite-dimensional subspace 
of L x {K,v) and f G L l (K, v). Then m* G P M (f) if and only if there exists an h G L°°{K,v) 
satisfying 

(i) \h{x)\ = 1, all x G K; 



(ii) / hmdv = 0, all m G M; 

J K 

(iii) / h{f-m*)dv=\\f-m*\\ 1 . 

J K 



Proof of Theorem 5.4: Let / G L x {K,v) and assume that / does not have a strongly unique 
best approximant from M. For convenience, we translate / by an element of Pm(I) so that we may 
assume, without loss of generality, that G Pm(/)- By Theorem 5.5, there exists an h G L°°(K, v) 
satisfying 

(i) \h{x)\ = 1, all x £ K; 



(ii) / hmdu = 0, all m G M; 

(iii) / /i/di/=||/||i. 



Note that conditions (i) and (iii) imply that h = sgn/ z/-a.e. on N(f). 

Given e > 0, it follows from the fact that v is a non- atomic positive measure and the absolute 
continuity of integrals that there exists a 5 > such that ii C Q K with u(C) < 5, then 



L 



\f\du<e. 



Let C C K, v(C) < 5, be such that if hm < on C, then m = 0. (Such a (7 can be found; see 
Pinkus [1989, p. 22].) Set 

' /, off C 
0, on C. 



fs---- 



Then 



II/- /.Hi = / l/|dz^<e 
Jc 



Since sgn / = sgn/ £ off C, it follows that for the above h, we also have 
(iii') / A: /i/ e di/=||/ e || 1 . 
Thus G P M (f e ). 

As h = sgn / = sgn/ e on N(f £ ) C N(f), we have from (ii) 

= / hmdu= / [sgn/ £ ]mdz^+ / hmdv + j hmdv 

JK JNUe) JC Jz(fe)\C 



for all m G M. If 



for some m G M, then we must have 



[sgn / e ]m dz^ 



\m\ dv 



I hmdv + / hmdv 
Jc Jz(f e )\c 



/ |m|dz/+ / 
Jc Jz 



\m\ dv 



z(h)\c 
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implying that hm is of one sign on C. However this in turn implies that m = 0. Thus 

/ [sgn/ e ]moV < / \m\ du 
JN(f,) Jz(fr) 



for all m G M, ra/0, and the zero function is therefore the strongly unique best approximant to 
f £ from M. □ 

In Smarzewski [1988], it is proven, in the general L l (K,v) space without any assumptions on 
the measure v, that those elements with a strongly unique best approximant are dense in the set 
of elements with a unique best approximant. His proof in this case parallels his proof of Theorem 
2.5. This proof immediately implies Theorem 5.3. It does not directly imply Theorem 5.4. For v 
a non-atomic positive measure, the density in L 1 (K, v) of the set of functions that have a unique 
best approximant from M goes back to Havinson, Romanova [1972] and Rozema [1974]. 

In Theorem 3.1, we proved that strong uniqueness at / implies that the best approximation 
operator from M is locally Lipschitz continuous at /, i.e., if / has a unique best approximant from 
M and 

||/- m ||-||/-P M (/)||> 7 || m -P M (/)|| 
for all m G M and some 7 > 0, then for each g G X and any element of Pm^)-, we have 

||Pm(/)-Pm(s)||<^||/-<?||. 

This implies that in the inequality 

\\PmU)-Pm{s)\\<^\\S-91 
we can always take the minimal (optimal) cr(f) therein to satisfy 

2 



*(/)< 



1(f) 



where j(f) is the strong uniqueness constant for /. In Theorem 3.2, we proved that on C(B), 
assuming M is a finite-dimensional subspace, local Lipschitz continuity at / of the best approxi- 
mation operator and strong uniqueness at / are equivalent. But other than the above inequality, 
the relationship between <r(/) and j(f) is unclear. 

In the case of L X {K, v), we again have that local Lipschitz continuity at / of the best approx- 
imation operator from M and strong uniqueness at / are equivalent. Full details may be found 
in Angelos, Kroo [1986]. We consider here the particular case of where v is a non-atomic positive 
measure. In this case, despite the positive density statement of Theorem 5.4, for many /, we do 
not have strong uniqueness. But when we do have strong uniqueness then we also have 



Theorem 5.6. Let v be a non-atomic positive measure. Let M be a finite-dimensional subspace 
of L X {K, u). Then f G L X (K, v) has a strongly unique best approximant from M if and only if the 



Strong Uniqueness 



41 



best approximation operator from M is locally Lipschitz continuous at f. Furthermore in this case, 
we have 

where j(f) and <r(f) are as detailed above. 

Proof: If / has a strongly unique best approximant from M, then we have from Theorem 3.1 that 
the best approximation operator from M is locally Lipschitz continuous at /, and <j{f) < 2/j(f). 
Assume the converse, i.e., the best approximation operator from M is locally Lipschitz continuous 
at /. We will prove that / has a strongly unique best approximant from M and l/j(f) < <?{f). 

We assume that / ^ M and the best approximant to / is unique. If either of these assumptions 
does not hold, then the result follows easily. For ease of exposition, we assume that G Pm(/)- 
Let m G M, and assume 

11/ - m||i -||/||i= S >0. 

Set 

B:={x: f(x)(f-m)(x) > 0} 

and 

A:=K\(BUZ(f)). 

Since v is a non-atomic positive measure, Theorem 5.5 is valid. Let h G L°°(K, v) be as therein. 
Define g G L l {K, v) by 

{m(x), x G A 

f(x), xeB 
\m(x)\h(x) + m{x), x G Z(f). 

We shall prove that m G Pm{q) and ||/ — <?||i < 5 from which the results will follow. 
We note that as \h\ = 1 on K then 

/ h(g — m)du = \\g — m\\i 
Jk 

if we can prove that h(g — m) > 0. Now on A, we have h(g — m) = 0. On B, we have h{g — m) = 
h{f — m ) = |/ — m\, since h = sgn/ on N(f), and B C N(f) is where sgn/ = sgn(/ — m). On 
Z(f ), we have h(g — m) = \m\. Thus h(g — m) > implying that m G Pm{9)- From Theorem 5.5, 
it therefore follows that m G -Pm(s')- 
By the above, we have 

$ = 11/ -mil! -11/11!= / \f-m\dv- [ \f\dv 
= / \f — m\di>+ \m\ dv — \ \f\dv 

J AUB Jz(f) J AUB 

= / (/ — m) sgn(/ — m) dv — \ (f — m) sgn / di^ + / \m\dv — I m sgn f dv 

J AUB JAUB Jz(f) J AUB 

= / (/ — 777)[sgn(/ — m) — sgn/] dz^ + / \m\dv + I hmdv, 
Jaub Jz(f) Jztf) 
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Now 



since J K hmdv = for all m G M and h = sgn/ on A U B. On B, we have sgn(/ — m) = sgn/, 
while on A, we have sgn(/ — m) / sgn/. Thus 

5 = 2 / |/ — m|chv+ / \m\ + hmdv. 
J A Jz(f) 

= / \f-m\du+ / \f-f\du + / \\m\h + m\du 
J a Jb Jz(f) 

= \ \f — m\dv+j \m\+hmdv 

J A Jz(f) 

< 2 / \f — m\dv+ I \m\ + hmdv = S. 
J a Jzu) 

As the best approximation operator from M is locally Lipschitz continuous at /, then 

\\PMU)-P M (9)h<°U)\\f-9h 

for all g G L x (K,v*). Substituting the above g, where we recall that G Pm(/), m G Pm{9) and 
11/ - 3 ||i <8=\\f- m||i - (I/Hi, we have 

IHIi = II^m(/) - Pm( 5 )IIi < *{f)\\f - slli < - mil! - \\fU 

||m|| 1 < ( 7(/)[||/-m|| 1 -||/|| 1 ] 
for every m G M. This implies that / has a strongly unique best approximant from M, and 

' < *(/)■ □ 



7(/) 

We now consider strong uniqueness in the one-sided L 1 (-ftT) case. Let K be a compact set in 
° d satisfying if = intif. We consider / G C(if) with norm 

|d/x 



/ l/l- 

JK 



where fx is a non-atomic positive finite measure with the property that every real- valued continuous 
function is ^-measurable, and such that if / G C(K) satisfies ||/||i = then / = 0, i.e., || • ||i is 
truly a norm on C(K). For each / G C(K), set 

M(f) := {m : m G M,m < /}, 

where M is a finite-dimensional subspace of C(K). We consider the problem 

inf II/ — m||i = inf / If — m\du = inf / f — mda, 

meM(f) rn€M(f)J K rn€M(f)J K 

since / — m > for all m G M(f). This is equivalent to considering 

sup / rnd/j. 

m€M(f)JK 

We let -Pm(/)(/) denote the set of one-sided best approximants to / from M. That is, 
PmU)U) ■= {™* ■ ™* e M(/), ||/ - m*||i < ||/ - mid, all m G M(/)}. 
The following is one characterization of best one-sided ^-approximations. 
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Theorem 5.7. Let f G C(K). Assume M is an n-dimensional subspace of C(K) containing 
a strictly positive function. Then a one-sided best L 1 -approximation to f from M exists and 
m* G PM(f)(f) if an d only if there exist distinct points {xj}^ =1 in K, 1 < k < n, and positive 
numbers {Ai}^ =1 for which 

(a) (/ - m*)(xi) = 0, i = l, ...,k. 

(b) For all m £ M 



/ m 

J K 



dfx = ^ Xjm(xi). 



i=l 



We have the following result whose conditions are, unfortunately, difficult to verify and gener- 
ally rarely hold. 

Theorem 5.8. Let M be an n-dimensional subspace of C{K) and f G C(K). Assume there exist 
distinct points in K, and an m* G M(f) satisfying 

(a) (/ - m*)(xi) = 0, i = l,...,n. 

(b) IfmeM satisfies m(xi) = 0, i = 1, . . . , n, then m = 0. 

(c) There exist strictly positive values \, % = 1, . . . , n, such that 



,. n 

/ mdfi = y ^ Xjm(3 
Jk i=i 



for all m G M. 

In this case, m* is the unique best approximant to f from M(f) and there exists a 7 > such that 

11/ -m\\i- 11/ - m *\\i > l\\ m ~ m *lli 

for all m G M(f). 

Proof: Assume m G M(f). Then from (c) and (a), we have 

~ n n n ~ 

/ md|i = S2 ^i m i x i) < ^if( x i) = y2 \im*{xi) = \ m*d/i. 
Jk i=i i=i i= i Jk 

If equality holds, then we must have m(xi) = f(xi) = m*(xi), i = 1, . . . ,n. Thus from (b), we have 
m = m* and therefore m* is the unique best approximant to / from M(/). 

Let mi G M satisfy mi(xj) = Sij, i,j = 1, . . . ,n. From (b), it follows that we can construct 
these mi. Thus, we can write each m G M is the form m = Y^7=i fn{xi)mi. Set 

\ 

7 := mm — > 0. 

i=l,...,n ||mj||i 

Thus Aj > 7||mj||i for i = 1, . . . , n. Assume m G M satisfies m{xi) > 0, i = 1, . . . , n. Then, using 
(c), we obtain 



/ m 

Jk 



dfj, = ^2 Ajm(xj) > 7^m(xj)||mj||i > 7|| ^m(x i )m i ||i = 7||m||i. 



i=l i=l i=l 
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Now for each m G M(f), we have 

||/ - m||i - ||/ - m*||i = / (/ - m) - (/ - m*) d/x = / (m*-m)d/i. 

■/if iff 

From (a), and since m G M(f), we have (m* — m)(xi) = (/ — m)(xj) > 0, i = 1, . . . , n. Thus 

(m* — m) dfi > 7||m — m* ||i 



implying 

||/ — m||i — ||/ — m*\\i > j\\m — m*||i. □ 

Both Theorems 5.7 and 5.8 are essentially to be found in Niirnberger [1985]. In Section 9, we 
discuss non-classical strong uniqueness in the L 1 setting. 

6. Strong Uniqueness of Rational Approximation in the Uniform Norm 

In this section, we consider strong uniqueness of the best uniform approximant of functions / G 
C[a, b] by rational functions of the form 

R m ,n ■= {r = p/q : p £ Il m ,q G U n ,q(x) > Q,x G [a, b}}, 

where II n = span{l, x, . . . , x n }. In contrast with the previous approximation problems, we are 
here faced with a nonlinear set of approximants that makes even the question of existence of a 
best approximant a nontrivial problem. The general technique, based on bounded compactness of 
the unit ball in finite-dimensional spaces, is not applicable here. Nevertheless, it turns out that 
existence and uniqueness of the best rational approximant is still valid for arbitrary / G C[a,b]. 
However, there is an essential difference from uniform polynomial approximation: the possibility of 
degeneracy. A rational function r = p/q G R m ,n is said to be degenerate if r = or r G i? m _i 5 „_i. 
This degeneracy of rational approximants does not effect the uniqueness of the best approximant, 
but it does spoil the continuity of the best approximation operator and also the property of strong 
uniqueness. 

To explain all this, let us introduce the following quantity called the defect of the irreducible 
rational function r = p/q G R m ,n- 

M r \ . = / min{m -dp,n- dq}, r / 0; 
\ n, r = 0, 

where dp is the degree of the polynomial p. Clearly, r = p/q G R m ,n is degenerate if and only if 
d(r) > 0. Moreover, d{r) is the greatest number such that r = p/q G R m _d(r),n-d(r)- 

First, we are going to address the questions of existence, characterization and uniqueness of 
the best uniform rational approximant. This next result is due to Walsh [1931]. 

Theorem 6.1. (Existence of Best Rational Approximation) Any f G C[a, b] possesses a best 
approximant from R m ,n- 

Proof: Let = Pk/qu £ Rm,n be such that 

\\f - r k \\ ^ E(f) := inf ||/-r||, k -> oo. 
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Then ||rfc|| < M, k > 0, and we can also assume the normalization ||^|| = l,k > 0, i.e., \\pk\\ < 
M,k > 0. Thus, passing to a subsequence, we may assume that p k — > p G n m , g& — > g G n„, 
A; — >■ oo, g > 0, and ||g|| = 1. Furthermore, the inequality \pk\ < Mgfc, A; > 0, that clearly holds on 
[a, b] implies that \p\ < Mq, on [a, 6]. Thus p must vanish at every zero of q in [a, b] to at least the 
same multiplicity, which yields that r := p/q £ R m ,n and 

r k (x) ->• r(x), x£[a,b]\Z, (6.1) 

where Z is the discrete set of zeros of q. Choose now an arbitrary e > 0. Since [a, b] \ Z is dense 
in [a, 6], the continuity of functions involved yields that for some x G [a, 6] \ Z, we have 

||/-r||< r(x)|+ £ . (6.2) 

Since (6.1) holds for this x, we have for k large enough 

\f(x)-r(x)\ < \f{x)-r k (x)\+e<\\f -r k \\+e<E{f) + 2e. 

Combining the two inequalities, we obtain ||/ — r\\ < E(f) + 3e, i.e., ||/ — r\\ = E(f). Thus r is a 
best approximant to /. □ 

The next step will be to derive an analogue of the Alternation Theorem for rational approxi- 
mation. We start with two auxiliary lemmas. 

Lemma 6.2. Let r G R m ,n be irreducible. Then II m + rn n is a Chebyshev (T-) space of dimension 
m + n + 1 — d(r) on R. 

Proof: Let us first show that the dimension of U m + rll n equals m + n + 1 — d(r). If r = 0, then 
d(r) = n and this becomes trivial. So assume r / 0. Clearly, 

dim(il m + rU n ) = dim(II m ) + dim(rll n ) - dim(II m n rU n ) = m + n + 2 - dim(II m n rU n ). 

Furthermore, if p\ G II m n rli n with r = pjq then piq = pqi for some q\ G Yi n . Since the 
polynomials p, q are relatively prime, it follows that p\ = gp and qi = gq, where g is an arbitrary 
polynomial of degree at most min{m — dp, n — dq} = d(r). Thus dim(II m n rll n ) = d{r) + 1 and 
hence dim(II m + rll n ) = m + n + 1 — d(r). 

To prove the T-space property, assume some p\ + rqi G II m + rll n , pi G II m , qi G II n , r = p/q, 
has m + n + 1 — d(r) distinct zeros on 1R. Then pig + pgi has these same zeros. But evidently 
d(piq + pqi) < m + n — d(r), implying that p\ + rqi = 0. Thus II m + rll n is a T-space. □ 

Lemma 6.3. Let f G C[a, b] and assume that r* G R m ,n is its best approximant, and is irreducible. 
Then for every r G U m + r*U n ,r / 0, we have 

max [sgn(/ - r*)(x)] r(x) > 0. (6.3) 

Proof: Set r* =: p* jq* and consider an arbitrary r = p/q, p G IL m , q G Il n , without the assumption 
that q > on the interval [a, 6]. Then for every t > 0, small, we have 

g* + tq m ' n 
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and 



Clearly 



ii/- r *ii<ii(/-o + (V-^ 

-* p " ~ tp tn + 0(( 2 ) 



q* + tq 

where r\ := p p G (n m + r*H n ). Thus by the last two relations, we have 

\\f-r*+tn\\- \\f-r*\\ >Ct 2 

for t > 0, sufficiently small, and some C G M. This implies that t+(/ — r*,ri) > for every 
ri G ^ (n m + r*n„). Applying Theorem 2.1, we obtain that 

max [sgn(/ — r*)(x)] r(x) > 0, r G II m + r*Tl n . 

x€A f _ r * 

In particular, the last relation yields by Theorem 2.2 that the zero function is a best approximant 
to / from Il m + r*n„. But, by Lemma 6.2, this space is a Chebyshev (T-) space. Thus, in view of 
Theorems 4.5 and 2.2, the strict inequality (6.3) must hold. □ 

We can now verify the Alternation Theorem which is the analogue of Theorem 4.4 for rational 
approximation. It may be found in Achieser [1930]. 

Theorem 6.4. (Characterization of Best Rational Approximation) Let f G C[a, b], and r* G R m ,n 
be irreducible. Then the following statements are equivalent: 

(i) r* is a best approximant to f; 

(ii) the error function f — r* equioscillates on at least m + n + 2 — d(r*) points in [a, b]. 

Proof: (i) =>- (ii). By Lemma 6.3 and Theorem 2.2, the function is a best approximant to / — r* 
from II m + r*LT n . Since, by Lemma 6.2, this space is a T-space of dimension m + n + 1 — d(r*), 
Theorem 4.4 implies that (ii) must hold. 

(ii) =>■ (i). Assume ||/ — r|| < ||/ — r*\\ for some r G R m ,n- Since / — r* equioscillates on at least 
m + n + 2 — d(r*) points in [a, b], it follows that r — r* has at least m + n+1 — d(r*) distinct zeros 
in [a, b\. Setting r* =: p*/q* and r =: p/q, we obtain that the polynomial p*q — pq* has at least 
m + n+1 — d(r*) distinct zeros in [a, b]. But clearly d(p*q — pq*) < max{<9p* + n, dq* + m} < 
m + n — d(r*), a contradiction. Thus r* is a best approximant. □ 

As a byproduct of Theorem 6.4, we obtain that every / G C[a, b] must have a unique best 
approximant from R m , n ', see Achieser [1930] and Achieser [1947]. 

Corollary 6.5. (Uniqueness of Best Rational Approximation) Every f G C[a, b] possesses a unique 
best approximant from R myn . 

Proof: Assume that r* = p* /q* is a best approximant to / and ||/ — r\\ = \\f — r*|| for some 
r = p/q G R m ,n- By the same argument as in the proof of (ii) => (i) in Theorem 6.4, it follows that 
the polynomial g := p*q — pq* of degree < m + n — d(r*) has at least N := m + n + 2 — d(r*) points 
of weak sign change. That is, for some points a < x\ < ■ ■ ■ < xn < b, we have g{xj)g{xj-\) < 
for every 2 < j < N. Since dg < N — 2, the Lagrange Interpolation Formula yields that 

N 1 
^^ ) ^) =0 ' 

3=1 ^ JJ 

where u>(x) := (x — x±) • • • (x — xjv). Since u (xj) also alternate in sign, we must have g(xj) = 0, 
1 < j < N, i.e., <7 = 0, and hence r* = r. □ 
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We now have the necessary prerequisites to verify that strong uniqueness of best rational 
approximation from R myn holds if and only if the best approximant is non-degenerate. Moreover, 
our next theorem shows that the continuity of the best rational approximant operator also fails to 
hold exactly in the case of degeneracy of the best approximant. This theorem combines results of 
Maehly, Witzgall [1960], Cheney, Loeb [1964] and Werner [1964] (with regards to continuity of the 
best approximation operator) and Cheney, Loeb [1964] and Cheney [1966] (with regards to strong 
uniqueness) . 

Theorem 6.6. Let f G C[a, b]\R mtn and let r* = p* /q* G R m ,n be its best rational approximant. 
Then the following statements are equivalent: 

(i) d(r*) = 0; 

(ii) r* is a strongly unique best approximant; 

(Hi) the operator of best rational approximation from R m ,n is continuous at f. 
Proof: (i) =>■ (ii). For an arbitrary r G R m ,n, r ^ r*, set 

7(r) := 11/ - i- 11/ - r - II 



In order to verify strong uniqueness, we need to show that there exists 7 > such that 7(7-) > 7 for 
every r G R m ,n, r / r* . Assume, to the contrary, that there exists a sequence = pt/qk G R m ,n, 
r k 7^ r * , such that j(rk) —> 0. We may assume, without loss of generality, that ||fJfc|| + \\<lk\\ = 1) 
k G N. Passing, if necessary, to a subsequence, it can also be assumed that for some p G n m , 
q G II n , we have pk — > p, qk Q, as k — > 00, uniformly on [a, b]. In particular, ||p|| + \\q\\ = 1. 
Assume, in addition, that ||p*|| + \\q*\\ = 1 (where r* = p*/q*). 
Consider now an arbitrary x G Then 

7(r fc )||r*-r fc || = ||/ - r k \\ - \\f - r* \\ 

> (/ - r k ){x) sgn(/ - r*){x) - (/ - r*){x) sgn(/ - r*){x) 
= (r* -r k )(x)sgn(f -r*)(x). 

Thus using the fact that q k > and < 1, we obtain, for any x G Af_ r *, 

[sgn(/ - r*)(x)](q k r* - p k ){x) < j(r k )\\r* - r k \\q k (x) < 7(r fc )||r* - r fe ||. (6.4) 

Note that since j(r k ) — > 0, we must have ||rfc|| < M, k G N, for some M > 0. (Otherwise ||rfc|| — > 00 
for a subsequence of fc's that in turn would yield that 7(r^) — > 1 for the same subsequence.) This 
means that the right hand side of (6.4) tends to as k — > 00 yielding 

[sgn(/- r*){x)](qr* - p)(x) < 0, x G A f _ r *. 

In view of Lemma 6.3, this implies that qr* = p, i.e., p G II m n r*U n . On the other hand, using 
the fact that d(r*) = 0, we have dim(n m n r*H n ) = d(r*) + 1 = 1 (see the proof of Lemma 6.2 
for details). Since evidently p* is also an element of II m n r*U n , we must have p = cp* for some 
nonzero constant c. But then in view of qr* = p, the relation q = cq* also holds. Recalling the 
conditions ||p|| + \\q\\ = \\p*\\ + = 1, and q, q* > 0, we clearly have c = 1, i.e., p = p* , q = q* . 
Since q* > on [a, b], it follows that for some 5 > the relation q k {x) > S holds for all x G [a, b], 
for k sufficiently large. 
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Furthermore, since inequality (6.3) holds for every r in the finite-dimensional linear space 
n m + r*II n , and because the left hand side of (6.3) is a continuous function of r, we obtain that 
there exists an r\ > such that 

max [sgn(/ - r*)(x)] r(x) > r]\\r\\, r £ H m + r*IL n . 

Thus, in particular, there exists an x* k G Af_ r * for which 

[sgn(/ - r*)(x* k )} (q k r* - p k )(x* k ) > f]\\q k r* - p k \\- 
Using this last inequality together with (6.4) applied with x = x* k , we arrive at 

Vhkr* -Pk\\ < j(r k )\\r* - r k \\ = j{r k )\\ — (q k r* - p k )\\ < ^-^-\\qkr* - Pk\\- 

Qk 

Since r k / r* , we can divide both sides of this inequality by \\q k r* — p k \\ yielding rj < 7 ^ fc - > . But 
this contradicts the assumption that "y(r k ) — > 0. This completes the proof of strong uniqueness. 

(ii) =^ (iii). This implication is trivial since strong uniqueness implies Lipschitz continuity of the 
best approximation operator; see Theorem 3.1. 

(iii) =>• (i). Assume that to the contrary d{r*) > 0. We separate the proof into two cases. 

Case 1. f — r* has less than m + n + 2 points of equioscillation. In this case, / — r* has m + n + 2 — d 
points of equioscillation for some < d < d(r*). Evidently, for any e > 0, there exists an r\ 6 R m ,n 
such that \\r* — ri\\ < e and d(ri) = 0. Set g := f + (ri — r*), and denote by r 2 G R m ,n the best 
approximant to g. Clearly, r 2 / r\ since otherwise the error function g — r 2 would have less than 
m + n + 2 points of equioscillation, in contradiction to the fact that d{r\) = and Theorem 6.4. 
Thus by the uniqueness of best rational approximation (Corollary 6.5), 

\\g - r 2 \\ < \\g - nil = 11/ - r*\\. (6.5) 

Now, the function g — r\ = f — r* has m + n + 2 — d points of equioscillation, which by (6.5) yields 
that ri — r 2 must have at least m + n + 1 — d zeros in [a, b]. Set r\ =: pi/qi, r 2 =: P2/Q2- By 
the above observation that the nonzero polynomial p\q 2 — P2Q1 has at least m + n + 1 — d zeros, it 
follows that <9(pi(/ 2 — P2Q1) >m + n+ l — d. On the other hand, 

9{p\q2 -P2Qi) < max{m + dq 2 , n + dp 2 } <m + n- d(r 2 ). 

Thus, we obtain d(r 2 ) < d — 1. This implies that g — r 2 must have at least m + n + 2 — d(r 2 ) > 
m + n + 3 — d points of equioscillation. Moreover, ||/ — <7|| = \\r* — r\ || < e and hence in view of the 
continuity of the best approximation operator at /, we have that g — r 2 tends uniformly to / — r* 
as e — > 0. Recall that / — r* has exactly m + n + 2 — d points of equioscillation, while g — r 2 has 
at least m + n + 3 — d points of equioscillation. This leads to a contradiction as, in the limit, we 
cannot lose points of equioscillation. 

Case 2. We now assume that f — r* possesses at least m + n + 2 points of equioscillation. We also, 
for simplicity, set [a,b] = [0, 1]. We first consider the subcase |(/ — r*)(0)| < ||/ — r*\\. There then 
exist < a < 1 and 5 > such that 

\(f -r*)(x)\ <a\\f -r*\\, < x < 6. (6.6) 
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Note that all points of equioscillation of the error curve f — r* must be in [5, 1]. Set 

II f — r*\\ A 

Since r* is degenerate, it is easy to see that r^r" G R m ,n- Consider now the functions 

^ := ^ min{ rwrTM } ' fk:=f ~ 9k - 

Evidently, g k = if x > (5. Furthermore, whenever < x < 5, we have by the choice of A 
.... ^4fc(<5 — x) AkS , ,k5\\f — r*\\ . ,..„ 

^ -»><*>! = ^(i + LkiIm) £ HTTh) 5 (1 " a) "rw 1 £ (1 " a)ll/ - r '■ 

Using the last inequality together with (6.6), we obtain for < x < 5 

\(fk ~ rl*)(x)\ = \(f- r*)(x) + (r fc - g k )(x)\ < a\\f - r*\\ + (1 - a)||/ - r*|| = ||/ - r*||. 

Recalling that — r£* = / — r* whenever x > (5 and all points of equioscillation of the error curve 
f — r* are in [5, 1], it follows that the error curve — r£* equioscillates at least m + n + 2 times, 
i.e., by Theorem 6.4, r%* is the best approximant to fk- On the other hand, we clearly have 

||/ fc -/|| = ||g fc ||< y^' 1 -^0, fc^oo, 

while for every k 

A 

II ** * II II II \ 
ll r fc - r II = \\ r k\\ > 



q*(0)' 

Thus the continuity of the operator of best rational approximation fails to hold at /. This verifies 
the needed statement in the case |(/ — r*)(0)| < ||/ — r*||. 

Finally, if |(/ — r*)(0)| = ||/ — r*||, we can introduce a small perturbation to the function 
/ resulting in f E G C[0, 1] such that ||/ — / £ || < e, r* is still the best approximant to f £ and 
\(fe — r *)(0)| < ||/e — r*||. By the above argument, the continuity of best rational approximation 
fails uniformly at f £ by at least A/q*(0), and hence it will fail at / as well. □ 

The ideas in this section have been further generalized to other non-linear families, cf., Barrar, 
Loeb [1970], Braess [1973], and Braess [1986]. 



Part II. Non-Classical Strong Uniqueness 



7. Uniformly Convex Space 

Let X be a uniformly convex space (of dimension at least 2) with norm || • || . In such a case, we know 
that we always have uniqueness of the best approximant. But we rarely have strong uniqueness in 
the classical sense. As such, we look for a different non-classical form of strong uniqueness. This 
we do in the present context using the modulus of convexity. The modulus of convexity on X is 
defined as follows. For e G (0,2], we set 

5(e) := inf{l - : ||/||, \\g\\ = 1, ||/ - g\\ > e}. 

We recall that X is uniformly convex if 5(e) > for every e > 0. If X is a uniformly convex space, 
then the following estimate is valid; see Bjornestal [1979]. 
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Theorem 7.1. Let M be a linear subspace of X. Given f € X assume that m* is the best 
approximant to f from M. Then for all m £ M, m 7^ m* , we have 



11/ - m \\ ~ 11/ - > 11/ - m\\ 5 



\m — m 



\\f-m\ 

Remark. Note that as ||m — m*|| < ||/ — m\\ + \\f — m*\\, it follows that 



\m-m*\\ < ||/ - mjj + ||/ - m*|| = 1 | ||/-m*|| < 2 



11/ -H 



11/ - m| 



||/-m|| 



is well-defined. 



since ||/ - m*\\ < \\f - m\\. Thus 5('- 
Remark. 5 is a non-decreasing function. If 

11/ - m \\ ~ 11/ - m *\\ < o-) 
then, since ||/ — m|| > ||/ — m*||, we obtain 

||/-m||-||/-m*|| > \\f-m*\\8 



\m — m 



||/ — m*\\ + a 



which is a non-classical strong uniqueness inequality. 

Proof: As M is a linear subspace, the above claim is equivalent to verifying 



\h — ml 



> \\h — mil 5 



m 



\h — m| 



for any h such that is a best approximant to h from M. 

It easily follows from the definition of S that for any f,g<EX satisfying 
we have 



f + 9 


< r 




2 







< r, r > 0, 



Let, in the above, f = h and g = h — m, and r = — m|| > 
obtain 



Substituting, we therefore 



2h — m 



< \\h — ml 



1-6 



m 



|/i — m| 



Note that as is a best approximant to h from M, it follows that 



2h — m 




m 










> 


2 









m 



Thus 

ll^ll < \\h — mil 1 — 51 -j 

\ || Ai — m| 

that immediately translates into the desired inequality. □ 
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For a similar result, see Wegmann [1975], and also Lin [1989]. From Theorem 7.1, we may ob- 
tain the following result that was proven in Smarzewski [1986], Prus, Smarzewski [1987], Smarzewski 
[1987] by different methods. 

Corollary 7.2. Let M be linear subspace of X. Assume 

5(e) > Ce q 

for e G (0, 2] and q > 1. Then for m* , the best approximant to f from M, we have 

||/ - m\\ q - ||/ - m*\\ q > C\\m - m*\\ q , 
for all m G M, m / m* . Furthermore, if 

||/-m||-||/-m*||<<7 

then we also have 

||/ - m\\ - ||/ - m*\\ > Ck\\m - m*\\ q , 
where k := ||/ - m*||/(||/ - m*\\ + a) q . 

Proof: The first inequality follows immediately from Theorem 7.1. If q = 1, there is nothing to 
prove. For q > 1, we have 

||/ - m\\ - ||/ - m*\\ > ||/ - m\\6 \- / > ||/ - m\\C 



11/ - m ll J \ 11/ - m l 

Multiply through by ||/ — m|| 9_1 and use the fact that ||/ — m*\\ < \\f — m\\. 

The second inequality follows from the second remark after the statement of Theorem 7.1. □ 

Remark. Moduli of convexity satisfying 

5(e) > Ce q 

are called moduli of convexity of power type q. It is known that for any modulus of convexity, 
we can only have the above holding for q > 2. (The condition q > 1 in the above corollary is 
superfluous.) 

It is also known, see Hanner [1956], that if 5 p (e) denotes the modulus of convexity of L p or £ p , 
then we have 

g(e) _ f^ + o(£ 2 ), 1<P<2, 

In addition, in Hilbert spaces, we always have 5(e) = 1 — (1 — e 2 /4) 1 / 2 = e 2 /8 + 0(e A ) and Hilbert 
spaces are the most convex in that 5(e) < 1 - (1 - e 2 /4) 1 / 2 holds for any uniformly convex Banach 
space. For L p , we in fact have 

5 P (e) > c p e q 

for all e € [0, 2] where q = max{2,p} and 

P -y, 1<P<2, 

^F, 2 < p < OO. 
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The following result can be found in Smarzewski [1987]. Assume is an increasing convex 
function defined on R + satisfying (f)(0) = 0. Assume in addition that 

5{e) > K(fj(e) 

for e G [0,2], and is submultiplicative in that there exists a constant L > such that for all 
t, s G R + , we have 

<f)(ts) < L<p{t)4>(s). 

Then the following holds. 

Proposition 7.3. Let M be a linear subspace of the uniformly convex space X. Assume is as 
above. If m* is the best approximant to f from M then 

0(||/ - m||) - 0(||/ - m*||) > KL-^iWm - m*||). 

for all m G M, m 7^ m* . 

Proof: Since <fi is convex and 0(0) = 0, we have for any A G [0, 1] that 

0(At) = <p(Xt + (1 - A)0) < A0(t). 

Thus 



0(||/ - m*||) = 
Therefore, applying Theorem 7.1, 



\\f~rn*\ 
11/ -"ill 



11/ < 



0(||/- m||) - 0(||/ -m*||) > 0(||/ -m 



Wf-m* 
||/ -m 

||/ -m* 



11/ - m 

11/ - HI - 11/ - m * 



HWf ~m\\). 



-0(||/ -m|| 



> 5 



11/ - m 
\m — m* || 

11/ - HI 
m — m 



0(||/ -m|| 



0(11/ -mil) 



0(11/ -m||) 



\\f-m 
> KL^^Wm-m*]]). □ 

Corollary 7.2 easily follows from Proposition 7.3 by choosing 0(t) = t q (where L = 1). A 
different approach is the following which may also be found in Smarzewski [1987]. 

Theorem 7.4. Assume there exists a positive constant K and an increasing nonnegative function 
defined on R + such that 



9 + h 



<n[<KM) + <MIN)1 -Wh -H) 



holds for all /, g G X. Let M be a linear subspace of X. If m* is the best approximant to f from 
M, then 

0(||/ - m||) - 0(||/ - m*||) > 2K0(||m - m*||) 
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for all m € M, m ^ m* . 

Proof: As m* is the best approximant to / from M, we have that 



53 



/ — m + / — m* 



f 



m + m 



>\\f 



m 



Set g := f — m* and h = f — m. Then, as 4> is increasing, 



<KH/ -m* || )<</» 



f — m + f — m* 



< -[<f>(\\f - m*||) + 0(||/ - m||)] - K<K||m - m*| 



The desired inequality follows. □ 

For p > 2, it is known that for g, h G L p , we have 

ll5 + ^ + ll5-^<2 p - 1 (||C + 



This inequality is called Clarkson's Inequality; see Clarkson [1936]. Thus, it follows that we can 
apply Theorem 7.4 with 4>(t) = t p and K = 2~ p giving 



|| / _ m ||p_|| / _ m *||p>2 1 -f| 



m — m 



This improves somewhat the constant 2 p /p obtained as a consequence of Corollary 7.2. For a 
better constant, see Smarzewski [1986]. For 1 < p < 2, the inequality 



g + h 



pj 



holds for all f,g £ X. This is called Meir's Inequality; see Meir [1984]. Thus, we can take (p(t) = t 2 
and iiT = p{p — l)/8 giving 



|| / -m||^-||/-ml^>^- 1) 



* Il2 

nm-m || p . 



This improves somewhat the constant (p — l)/8 obtained as a consequence of Corollary 7.2. 

For p = 2, both these improved constants are 1/2. However this is not the optimal constant, 
as in any Hilbert space, we have 



||/ -m|| 2 - ||/ -m 



*i|2 II *||2 

' = m — m \\ 



for all m 6 M assuming m* is a best approximant to / from the linear subspace M. 

For other results in this direction, see Angelos, Egger [1984] and Egger, Taylor [1989]. 
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8. The Uniform Norm Revisited 

In this section, we return to the uniform norm over an interval [a, b]. It transpires that if we 
restrict our approximation to only "smooth" functions, then a weaker condition than the Haar 
condition is both necessary and sufficient to insure uniqueness of the best approximant to each 
"smooth" function if the finite-dimensional subspace M is itself "smooth". The following result, 
from Garkavi [1959], see also Kroo [1984], characterizes unicity spaces with respect to C r [a,b] for 
r > 1. Note that this characterization is independent of r. The characterization critically uses the 
fact (independent of r > 1) that if x G Aj_ m * n (a, b) and / — m* G C r [a,b] then we must have 
(/ - m*Y(x) = 0. 

Theorem 8.1. Let r be a fixed positive integer, and let M be an n-dimensional subspace of 
C r [a,b]. A necessary and sufficient condition for the uniqueness of the best approximant from M 
to each f G C r [a, b] is that the following does not hold: 

There exist k distinct points {xi}^ =1 in [a, b], 1 < k < n + 1, nonzero values {o~i}^ =1 , and an 
in G M\{0} such that 

(a) Yli=i crimfai) = all m G M. 

(b) fh(xi) = 0, i = 1, . . . , k. 

(c) m'(xi) = if Xi G (a, b). 

In Section 4, we defined a Chebyshev (T-) system as a system of functions mi, . . . , m n defined 
on C[a, b] such that no nontrivial m G M = spanjmi , . . . , m n } vanishes at more than n — 1 distinct 
points of [a, b]. We often use the term T-system interchangeably for both m 1; . . . , m n and for the 
space M. We say that M = spanjmi, . . . , m n } is an £T 2 -system if M C C 1 [a, b] and no nontrivial 
m G M vanishes at more than n — 1 distinct points of [a,b], where we count multiplicities up to 
order 2, i.e., we count X as a double zero if m(x) = m'(x) = 0. From simple zero counting, we 
obtain the following result, that is essentially in Garkavi [1959]. 

Proposition 8.2. Let r be a fixed positive integer. Let M be an n-dimensional subspace of 
C r [a,b]. Assume M contains a T-system of dimension m and is contained in an ET 2 -system of 
dimension I, with m > £/2. Then to each f G C r [a, b], there exists a unique best approximant from 
M. 

Based on the above Theorem 8.1, it is natural to ask whether strong uniqueness always holds 
in this setting. We first give an example to show that classical strong uniqueness need not hold. 
We then show that a non-classical strong uniqueness with 4>{t) = t 2 holds in this case. 

Example. Let M = spanjl, x, x 3 } in C[— 1,1]. Since M contains a T-system of dimension 2 
(span{l,x}), and is contained in an i?T 2 -system of dimension 4 (spanjl, x, x 2 , x 3 }), it follows from 
Proposition 8.2 that M is necessarily a unicity space with respect to C^-l, 1]. Set f(x) := 2x 2 -l. 
As is easily checked, its best approximant from M is the zero function. Since Af = { — 1, 0, 1}, we 
have 

7(/) = min max{m(— 1), — m(0), m(l)}. 

771 6 M 
||m|| = l 

Choosing m(x) = x — x 3 G M, we see that ^y(f) = 0, and therefore classical strong uniqueness does 
not hold. 

Let us consider the above example in further detail. Set m a (x) := a(x — x 3 ). For a sufficiently 
small, we have 

11/ -mall > ll/ll +7lKf 
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for some 7 > if and only if p > 2. To see this, note that ||/|| = 1 and ||m a || = 2|a|/3\/3. Thus 
the right-hand side of the above is of the form 

(MY 

Set 

g(a) := \\f — m a \\ = max |(2x 2 — 1) — a(x — x 3 )\. 
xe[— 1,1] 

For \a\ sufficiently small, we have 

9(a) = I + ? v 'IT^ + ^(v'IT^-2). 

Expanding y/4 + 3a 2 in a Taylor series about a = 0, we obtain 

g(a) = 1 + -a 2 — ca 4 + ■ ■ ■ 



for some c > 0. Thus there exists a 7 > such that 

g{a) > I + 7 



(MY 

\3V3j 



for a sufficiently small if and only if p > 2. 



Let M C C r [a,b] be an n-dimensional unicity space with respect to C r [a,b]. The previous 
example showed that we cannot hope to obtain a classical strong uniqueness theorem. We shall, 
however, prove the following result due to Kroo [1983a]. 

Theorem 8.3. Let r be a positive integer, r > 2. Let M C C r [a, b] be a finite-dimensional unicity 
space with respect to C r [a,b], and let f G C r [a,b]. Given any positive constant a, there exists a 
positive constant 7, dependent on f, M and a (but independent of specific m G M) such that if 
m* is the best approximant to f from M, and ifmeM satisfies 

||/-m||-||/-ml<<7, (8.1) 

then 

||/ - m\\ - ||/ - m*\\ > 7 ||m - m*f. (8.2) 

Remark. Theorem 8.3 is a simpler case of a general result in Kroo [1983a]. This more general 
result also covers the case where M C C 1 [a, b] is an n-dimensional unicity space with respect to 
C^a, b]. The proof thereof is somewhat more cumbersome as the modulus of continuity of /' and 
m' enter both the statement of the result and the analysis, i.e., a non-classical strong uniqueness 
result is proven with a more complicated We choose to deal with the simpler case as presented 
here. 



Prior to proving this theorem, let us prove an ancillary result concerning local strong unique- 
ness. 
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Proposition 8.4. Let (f> be any nonnegative increasing function defined on [0, oo). Assume there 
exists a o\ > such that if m £ M satisfies 

11/ -m|| -||/- m*|| < <7i 

then 

11/ - m \\ ~ 11/ ~m*\\ > 7i<£(||m - m*||) 

for some 71 > 0. Then given any o 2 > 0, there exists a 72 > 0, dependent on 71, <Ji, 02, <?!> and 
||/ — m* ||, such that, for every m G M satisfying 

11/ - m ll - 11/ - TO 1 < cr 2, 

we have 

11/ - HI - 11/ - > 72<K||m - m*||). 
Proof: We need only consider the case where U2 > o\. For all m G M satisfying 

11/ - m \\ ~ 11/ -m*\\ < 02, 

we have 

|| m — m*\\ < \\f — m\\ + \\f — m*\\ < 2||/ — m*|| + cr 2 . 
Thus for all such m, we have 

<t>{\\m-m*\\)<<t>{2\\f -m*\\+a 2 ). 

Choose 7* > for which 

7*0(2||/ - m*|| + o 2 ) <<n 

and set 

72 = min{7i,7*}. 

Then for m £ M satisfying 

||/ -m|| -||/- m*|| < <7i, 

we have 

||/ - m|| - ||/ - m*|| > 7i0(||m - m*||) > 72^(||m - m*||). 
For m £ M satisfying 

<?i < 11/ - m|| - ||/ - m*|| < cr 2 , 

we have 

720(||m-m*||) < 7*<P(\\m-m*\\) <7*<P(2\\f -m*\\+a 2 ) 
< ti< ||/-m||-||/-m*||. □ 

Proof of Theorem 8.3: To prove the theorem, we show that if 

||/ -m|| -||/ -m*|| =5 2 (8.3) 
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then 

y/j \\m - m*\\ < 5. (8.4) 

We shall choose a sufficiently small 5 > depending on / and M. This proves the theorem for 
a = 5 2 . We then apply Proposition 8.4 to get the full result (with a different 7). 

Recall from Theorem 2.3 that the function m* G M is a best approximant to / from the 
n-dimensional subspace M if and only if there exist k distinct points xi,...,Xk G Af_ m *, for 
1 < k < n + 1, and strictly positive values Ai, . . . , A& such that 



^ Ai[sgn(/ - m*)(xi)]m(xi) = 



i=l 

for all m G M. 

Furthermore, from Theorem 8.1, the fact that M is a unicity space with respect to C r [a,b] 
implies that for the {xi}f =1 as above, if m G M satisfies m(xi) = 0, i = 1, . . . , k, and m'(xi) = 
if Xi G (a, 6), then m is necessarily identically zero. Thus the unicity space property says that the 
functional 1 1 1 • 1 1 1 defined on M by 

|||m||| = max |m(xi)| + max Im'^)! 

i=l,...,k i=i,...,k 

is a norm on M. Since M is finite-dimensional, this norm is equivalent to the uniform norm. Thus, 
there exists a constant c\ such that 

\\m — m*\\ < ci\\\m — m*\\\ 

for all m G M. Here and in what follows, the Cj will always denote positive constants that depend 
only upon / and M. It therefore suffices, in proving (8.4), to prove that, under conditions (8.3), 
we have 

|(m — m*)(xi)\ < c 2 S, i = l,...,k, (8.5) 

and 

\(m — m*)'(xi)\ < c^S, if Xi G (a, b), i = l,...,k. (8-6) 

We assume that (8.3) holds and will prove (8.5) and (8.6). We prove (8.5) and (8.6) in two 
distinct lemmas. In the first lemma, we prove more than what is formally needed to verify (8.5), 
but apply this better estimate in the second lemma to prove (8.6). 

Lemma 8.5. Under the above assumptions, (8.5) holds. In fact, \(m— m*)(xi)\ < c^5 2 , % = 1, . . . , k. 
Proof: We assume m G M. Since Xi G Af_ m *, we have 

11/ -m|| 2 > \(f - m )( Xl )\ 2 = \(f - m *)( Xi ) - ( m - m *)( Xl )\ 2 

= ||/ - m*|| 2 - 2||/ - m*\\ sgn[(/ - m*)(xi)](m - m*)(xi) + |(m - m*)(xi)\ 2 . 

From (8.3), we have 

11/ - m|| -II/- m*|| =5 2 , 

and therefore, assuming 5 < 1, 

11/ " HI 2 = (11/ ~m*\\ + 5 2 ) 2 < ||/ - m*|| 2 + c 5 5 2 . 
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Thus 

-2||/ - m*\\ sgn[(/ - m*)(xi)](m - m*)(xi) + |(m - m*)(xi)| 2 < c 5 <5 2 

implying that 

sgn[(/ - m*)(xi)](m - m*)(xj) > -c 6 5 2 , i = 1, . . . ,k. 

Recall that 

k 

^2 Aj[sgn(/ - m*)^)]™^) = 

i=l 

for all m G M, and normalize the A, so that 2Ji=i Aj = 1. The A, are independent of any specific 
m. Thus 

1 k 

-c 6 5 2 < sgn[(/ - m*)(xi)](m - m*)(x i ) = - — ^ A 3 - [sgn(/ - m* ) )] (m - m* ) (x 3 - ) 

1 3 = 1 

1 ^ 

= — ^A i (-[sgn(/-m*)(x J )](m-m*)(x J )) < c 7 5 2 . 

Ai 3 = 1 

We have shown that 

-c & 8 2 < [sgn(/ - m*)(xi)](m - m*)(xi) < C78 2 , 

implying that 

\{m — m*){xi)\ < c^5 2 , i = l,...,k. □ 

Lemma 8.6. Under the above assumptions, (8.6) holds, i.e., \(m — m*)' (xi)\ < C38, if Xi G (a, b), 
i = 1, . . . , k. 

Proof: Assume Xi G (a, 6). From Lemma 8.5, 

|(m - m*)(xj)| < c 4 5 2 , 

where 

\\f-m\\-\\f-m*\\ = 5 2 . 
Now for G (a, 6) and 5 > such that Xi ± <5 G [a, 6], 

[sgn(/ - m*)(xj)](m - m*)(x, ± <5) = [sgn(/ - m*)(xj)] ((/ - m*)(a; i ± 8) - (/ - m)(xj ± (5)) 

> [sgn(/ - m*)(xi)](f - m*)(xi ±8)- \\f - m\\ 
= [sgn(/ - m*)( Xi )](f - m*)( Xi ±8)- \\f - m*\\ - 8 2 
= [sgn(/ - m*)(xi)] ((/ - m*)( Xl ± 8) - (f - m*)( Xi )) - 8 2 . 

Set 

h(v) : = (/ - m *)(.Xi + y)~{f - m*)(xi). 

As Xi G ^4/_ m * n (a, 6), we have (/ - m*Y(x i ) = 0. Thus we see that h(0) = h'(0) = 0. Since 
h G C r , r > 2, in a neighborhood of the origin, it therefore follows that \h(y)\ < c 8 y 2 for some 
constant Cg. Thus 

[sgn(/ - m*)(xi)](m - m*)(x, ± 5) > -c 8 <5 2 . (8.7) 
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In addition, from Taylor's Theorem, 



[sgn(/ — m*)(xj)](m — m*)(xi ± 5) < [sgn(/ — m*)(xi)](m 



m*)(xi) 



±[sgn(/ - m*)(xi)](m - m*)'(xi) 5 + c 9 5 2 . 
Thus, as \{m — m*)(xi)\ < c±b 2 and using (8.7), we have 



±<5[sgn(/ - m*)(xi)](m - m*)'(xi) 

> [sgn(/ - m*)(xi)](m - m*)(xi ± 6) 

> [sgn(/ - m*)(xi)](m - m*)(xi ± S) 



[sgn(/ - m*)(xi)](m 
cwS 2 > -cud 2 . 



m*)(xi) - c 9 5 2 



We have proven that 



±5 [sgn(/ - m*)(xi)](m 



m*)'{ Xi ) > - Cll 5 2 



implying 



\(m - m*)'{xi)\ < c u S. □ 



Proof of Theorem 8.3 (cont'd): Lemmas 8.5 and 8.6 together prove Theorem 8.3. □ 

The example prior to Theorem 8.3 shows that, in general, we cannot expect to obtain a power 
less than 2 if M is not a Haar space. In fact, a result is proved in Kroo [1983a] which shows that 
this estimate is optimal for any such subspace. We state this here without proof. 

Theorem 8.7. Let r be a positive integer, r > 2. Let M C C r [a,b] be a n-dimensional unicity 
space with respect to C r [a,b], and assume M is not a Haar space, i.e., there exists an m £ M, 
m/0, with n distinct zeros in [a,b]. Then there exists an f G C°°[a,b] with best approximant 
m* G M, and a sequence of elements G M, rrik — > m* , such that 



In this section, we consider the space -L x [a, b] with measure fi. We assume in what follows that fi is 
a non-atomic positive finite measure on [a, b] with the property that every real-valued continuous 
function is /U-measurable, and such that if / G C[a, b] satisfies ||/||i = then / = 0, i.e., || • ||i 
is truly a norm on C[a,b]. We consider finite-dimensional subspaces M C C[a, b] that are unicity 
spaces with respect to all / G C[a, b] in the L x [a,b] norm, i.e., for which there exists a unique 
best approximant from M to each / G C[a, b] in the L l \a, b] norm. We restrict our consideration 
to approximation to continuous functions because if we consider approximation to all functions 
in L 1 ^, b] then there are no finite-dimensional unicity spaces. But there are many unicity spaces 
in this continuous setting, cf. Pinkus [1989]. One characterization of these unicity spaces is the 
following which may be found in Cheney, Wulbert [1969, Theorem 24]. 



11/ - m k \\ - [[/ - m*|| < 7||m fc - m 



for some constant 7 > that depends only on f and M and is independent of k. 
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Theorem 9.1. Let fj, be a non-atomic positive measure, as above. Let M be a finite-dimensional 
subspace of C[a, b] . Then M is a unicity space for all f G C[a, b] in the L 1 [a, b] norm if and only if 
there does not exist an h G L°° [a, b] and an rh G M, fh ^ 0, for which 

(i) \h(x)\ = 1, all x G [a,b], 

(ii) / hmdfi = 0, all m G M, 



(iii) /i|m| is continuous. 

Let (5) denote the standard modulus of continuity of / G C[a, 6], i.e., 

u; (/;<$) = max{|/(x) - /(y)| : x,y G [a, 6], |x - y\ < 5}. 

We recall that co(f; ■) is a continuous nonnegative non-decreasing function for 5 > with oj(f; 0) = 0. 
For any such continuous nonnegative non-decreasing function F, let F~ 1 (x) = min{y : F(y) = x} 
denote its inverse. Note that F~ x is nonnegative and increasing, but may have jumps, i.e., it need 
not be continuous. This next result from Kroo [1981a] provides a non-classical strong uniqueness 
result in the case where M is a finite-dimensional unicity space in C[a, b] in the L x {a, b] norm. It is 
well-known, for example, that Haar spaces satisfy this condition. For Haar spaces, this next result 
was obtained by Bjornestal [1975]. 

Theorem 9.2. Let M be a finite-dimensional unicity space in C[a,b] in the L 1 ^, b] norm. If 
f G C[a, b], m* is the best approximant to f from M, and o > is an arbitrary positive constant, 
then for any m G M satisfying \\f — m||i — ||/ — m*||i < a, we have 

[[/ - m\\i - ||/ - m*||i > j\\m - m*\\ 1 uj~ 1 (f; D\\m - m*||i) 

where 7, D > are constants that depend only on f, a and M. 

Proof: For convenience, we assume that m* = 0, i.e., replace/ — m* by/. Set e := ||/— m|| 1 — ||/||i. 
We shall prove that if e < 1 then 

e > 7||m||io; _1 (/; D||m||i) 

for some constants 7, D > that depend only on / and M. From Proposition 8.4, this suffices to 
prove the theorem. 

As G Pm(/), there exists, by Theorem 5.5, an h G L°°[a,b] satisfying 

(i) \h(x)\ = 1, all x G [a, b], 

(ii) I hmdfj, = 0, all m G M, 

J a 



(iii) / 

J a 



b 

hfd/j, 



From (i) and (iii), it follows that h{x) = sgn/(x) [i-a.e. on N(f). 
For each x G [a, b], we define the ^-neighborhood of x as 



O s := (x - S, x + S) n [a, b]. 
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Given the above h G L°°[a, 6], let S(f) denote the set of x G [a, 6] for which 

fi(O s {x)n{x : h(x) = 1}) > 

and 

fi(O s (x) n {x : h(x) = -1}) > 

for all 5 > 0. That is, S*(/) is the set of essential sign changes of h. It is easily verified that if S(f) 
is not empty then it is closed and hence compact. Thus for each fixed 5 > 0, we have 

mm inf n(O s (x) n {x : h(x) = a}) = [3 f (6) > 0. (9.1) 

We have yet to show that S(f) / 0. This follows immediately from Theorem 9.1. In fact, we 
have that 

|||m||| := max 
*eS(/)' 

is a norm on M. We prove this as follows. By the equivalence of norms on finite-dimensional 
subspaces of C [a, b] it suffices to prove that only the zero function in M vanishes identically on 
S(f). Assume to the contrary that there exists an fh G M, fh ^ 0, that vanishes on S(f). We 
claim that we may choose h as above so that h\fh\ G C[a, b\. To see this, assume fh does not vanish 
on an interval (c,d). As (c,d) n S(f) = for each x £ (c,d), there exists a 6 X > such that 
h = 1 or h = — 1 /x-a.e. on (^^^(x). From the connectedness of (c,d), this implies that we may take 
h = 1 or h = — 1 on all of (c, d). Thus /i|m| is continuous on (c, d), and hence it easily follows that 
h\fh\ £ C[a,b]. But, by Theorem 9.1, this contradicts the fact that M is a unicity space for all 
/ G C[a,b] in the L 1 [a, 6] norm. No such fh exists and therefore ||| • ||| is a norm on M. 

As HI • HI is a norm on M, it therefore follows that there exists a constant c\ > 0, that depends 
only on M, satisfying 

|||m||| > ciUmlloo 

for all m G M. Here and in what follows, the cj will always denote positive constants that depend 
only upon / and M. 

Returning to the proof of the theorem, we have 

e = Wf-mW.-WfW, 

b r-b 

(/ - m) sgn(/ - m) dfi - / /sgn/d^ 

J a 



= / (/ — m) sgn(/ — m) dfi + / (— m) sgn(— m) dfi — / /sgn/d^ 
JN(f) Jz(f) JN(f) 

= / (/ — m) [sgn(/ — m) — sgn /] dfi — / msgn/d/i+ / |m|d/x. 
JN(f) JN(f) ' Jz(f) 



'N{f) JN(f) JZ{f) 

From (ii), we have 



/ 



b 

hm dfj, = 



for all m G M, while h = sgn / on N(f). Thus 

e = I (f — m) [sgn(/ — m) — sgn /] dfi + / mhdfi+ / |m| d/j, 

JN(f) ' ' JZ(f) ' JZ(f) 

= 2 / \f — m\dfi+ \m\ + mhdfi. 

J{x:f(f-m)<0} ' Jz(f) 



(9.2) 
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For m G M, as above, let x* G S(f) satisfy 

|m(x*)| = |||m||| > ciUmlloo 
and for convenience set a* := sgnm(x*). Thus, there exists a 5 > such that for all x G Og(x*), 



and thus by (9.1) 

Let 

and 

By (9.4), we have 



a*m(x) = \m(x)\ > ^||wi||oo (9-3) 



H{0 5 {x*) n {x : h(x) = a*}) > f3 f (5) = c 2 > 0. (9.4) 
B := 5 {x*) n {x : x G N(f), sgn/(x) = a*} 
Q ■= O s (x*) n {x : x G Z(/), /i(x) = a*}. 
+ > c 2 . 



We now consider three options. 

I. Assume n(Q) > C2/2. Since Q C Z(/), we have from (9.2) that 

s > \m\+mhdfi> / \m\+mhdfi = 2 / |m| d/x 

iz(/) Jq Jq (9.5) 

> Ci||m|| 00/ u(Q) > c 3 1 1 m 1 1 1 

due to the equivalence of the || • ||i and || • || oo norms on M. Now if m G M satisfies ||/— m||i — ||/||i < 1 
then 

IHIi < ||/-m||i + ||/|| 1 <2||/|| 1 + l, 

i.e., all such m are uniformly bounded in norm, and therefore u~ l {f; ||m||i) is also uniformly 
bounded above for all such m. Thus (9.5) implies the desired result. 

II. Assume n{B) > c 2 /2 and a*f(x) < (ci/4)||m|| 00 for all x G 0$(x). From (9.3) and our 
assumption, we have that 

B C {x : f(x)(f-m)(x) < 0} 

and therefore from (9.2) and (9.3) 

e > 2 / \f-m\dn > 2 / \f-m\dfi 

J{x:f{f-m)<0} Jb 

> 2 J y IMloo - ^-Umlloo dfi = yllmHoo/^-B) > c 4 ||m||i 
which, as above, implies the result. 

III. Assume n{B) > c 2 /2 and there exists an x G 0§(x*) for which a*f(x) > (01/4)111711(00. 
Since x* G Z(f), we have f(x*) = 0. Assume, without loss of generality, that x > x* . Choose 
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x* < x\ < x 2 < x so that f{xi) = 0, cr*/(x 2 ) = (ci/^HmHoo and < cr*f(x) < (ci/^HmHoo on 
(xi,x 2 ). As /(x 2 ) - /(xi) = ct^ci/^IHIoo, we have 

x 2 - xi > uj 1 ^Umlloo^ . (9.6) 

Since (xi,x 2 ) C (^(x*) from (9.3), we see that 

/(x)(/-m)(x) < 
for x G (xi,x 2 ). Thus from (9.2) and (9.6) 

£ > 2 / |/-m|d/i > 2 / -^Umlloo - — Umllood/x 

Cl Cl -1 / C l II \ I, -1 /T-.II 

= y ||"i||oo(x 2 - xi) > —\\mWooUjf y— UmllooJ > -/WmWxLjf (£)||m||i). 
This proves the theorem. □ 

We now turn to the one-sided L 1 -approximation problem. For this problem, unicity spaces 
are rare when considering approximating all / G C[a, b] from some finite-dimensional subspacc 
M in C[a, &]; see Pinkus [1989]. The situation is different if we consider the case where both 
the approximating subspace M and the functions to be approximated are in C 1 [a,6], i.e., are 
continuously differentiable. In this case, there are many unicity spaces. 

For convenience, we again consider the interval [a, b\. The more general case can be found in 
Pinkus [1989]. For each / G C 1 ^, 6], we define Z\{f) to be the set of zeros of / in [a, b] that only 
includes interior zeros if the function and its derivative vanish, i.e., x G (a, b) is in Zi(f) only if 
f{x) = f( x ) = 0. Note that if / € C l [a,b] satisfies / > 0, then Z x (f) = Z{f). In what follows, 
we also always assume that M contains a strictly positive function. With these assumptions, we 
have the following characterization of unicity subspaces that is from Pinkus, Strauss [1987]; see 
also Strauss [1982] and Pinkus [1989]. 

Theorem 9.3. An n-dimensional subspace M of C [a, b] is a unicity space for C [a, b] in the one- 
sided L x -norm problem if and only if there does not exist an m* G M, m* ^ 0, points {xi}^ =1 in 
Zi(m*), 1 < k < n, and positive values {Aj}^ =1 satisfying 



md/i = A^m(a 

i=i 



for all m G M. 



Let mi, . . . , m n be any basis for the n-dimensional subspace M. In what follows, we assume 
M C C x [a,b]. Given an / G C^a, b], we set 

n 

H f {5) := W (/'ii) + j; w K;i). 

i=l 



With these assumptions and definitions, we can now state the next result which is in Kroo, Sommer, 
Strauss [1989]. Recall that M(f) := {m : m G M,m < /}. 



A. Kroo and A. Pinkus 



64 



Theorem 9.4. Assume M C C l [a,b] is a finite-dimensional unicity subspace for C [a,b] in the 
one-sided ^-norm problem. Assume f £ C [a,b], m* is the best approximant to f from M{f), 
and a > is an arbitrary positive constant. If m £ M(f) is such that \\f — m\\i — \\f — m*\\i < a, 
then we have 

11/ -m||i -||/ -m*||i > 7 ||m-m*||iiJ7 1 ( J D||m-m*||i) 
for some constants 7, D > that depend only on f, a and M. 

If, for example, / and the elements of M are such that their derivatives are all in Lip a, 
< a < 1, then Hf(5) = 0(5 a ), and from the above, we obtain 

II J! II II J! *ll \ II *ll( a + l)/ a 

||/ — m||i — ||/ — m ||i > 7||m — m \\\ 

for some other constant 7 > that depends only on / and M. 

Proof: For convenience, we assume that m* = 0, i.e., replace / — m* by /. Set 



e ■= ||/ _ m || 



i- 



and assume < e < 1, as in the proof of Theorem 9.2. Since the zero function is a best one-sided 
/^-approximation from the n-dimensional subspace M, we have that / > 0, and from Theorem 5.7, 
there exist distinct points {xi}^ =1 in Z 1 (f), 1 < k < n, and positive numbers {Ai}f =1 for which 



J a 



b k 
i=l 



for all m £ M. 
Now 



f*b f*b rb n 

£ = ||/-m||i - (I/Hi = / (f-m)du.- fd/j,= - m dfi= -^2\im(xi). 

J a J a J a 

As f(xi) = and / — m > 0, we have m{xi) < and thus 

k 

e = ^\i\m(xi)\. 

i=i 

Recall that our aim is to prove that there exist constants 7, D > that depend only on / and M 
such that 

k 

e = ^ Xi\m(xi)\ > 7||m||i^ 1 (D||m||i). 

i=l 

As M is a unicity space for C 1 [a, b] in the one-sided L^norm problem, we have from Theorem 
9.3 that there does not exist an rh G M, rh / 0, points {yi}i =1 in Z 1 (fh), 1 < k < n, and positive 
values {/ii}f =1 satisfying 

r b k 

mdfi = ^^m(yi) 

i=l 



f 

J a 
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for all m G M. Thus, for the above {xi}^ =1 , if m G M satisfies m(xi) = 0, i = 1, ... , fe, and 
m'(xi) = 0, Xj G (a, 6), then necessarily m = 0. This implies that 

fc 

Hl m lll := X/ l m ( Xi )l + X/ \ m '( X i)\ 

i=l Xi£(a,b) 

is a norm on M and therefore 

< cilll 771 -!!! (9-7) 

for all m G M. As usual, the Cj will always denote positive constants that depend only upon / and 
M. 

As e = Y17=i ^i\ m { x i)\ where > for all i, we have 

\m{xi)\ < c 2 e, i = l,...,k. (9.8) 

To use the norm |||m|||, we must also bound the |m'(xi)| for Xi G (a, b). To this end, assume 
m = Yh=i b i m i- Tnen 

n n 

u)(m';5) < > |fej| w(mj; <5) < ( max cj(m^;5). 

^ — J 7 = 1 n ^— ' 

i=l i=l 

As mi, . . . , m n is a basis for M, it follows that max J= i v .. ]n \bj\ is also a norm on M. Therefore 
maxj^...^ \bj\ < C3 1 1 m 1 1 1 . Furthermore, Yl7=i w ( m i! — ^/(^)- Thus 

w(m';5) <C3||m||ifl/(5). (9.9) 

For m satisfying ||/ — m||i — ||/||i < 1, it follows that 

||m||i<||/-m|| 1 + ||/||i<2||/|| 1 + l. 

Thus ||m||i is uniformly bounded above and from (9.9), we have 

u(m';S) < c 4 H f (S). (9.10) 

Consider Xi G (a, b). Choose 1 > n > such that the (xi — n,Xi + 77) are disjoint intervals of 
(a, 6). For each m G M, < h < n and 6 = sgnm'(xj), we have 

m(xj + bh) = m(xj) + bhm'(xi) + E = m(xi) + h\m'(xi)\ + £\ (9.11) 

An easy estimate and (9.10) shows that 

\E\ < hu(m';h) < c A hH f {h). (9.12) 

Since f(xi) = f'(xi) = 0, we therefore have 

\f( Xi + bh)\ = \f(xi + bh) - f( Xi )\ < hoj(f; h) < hH f (h). (9.13) 
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From m( Xi + bh) < f(x t + bh), we have by (9.7), (9.11), (9.12) and (9.13) 

m(xi + bh) — m(xi) — E 



\m'{xi)\ 



h 

< \ ( f(xi + bh) + c 2 e - E) 

< - (hH f (h) + c 2 e + c 4 hH f (h)) 



(9.14) 



(9.15) 



h 

= c 5 (H f (h) + ^). 

Using the equivalence of norms, from (9.7), and (9.14), we finally obtain 

/ k 

\\m\\i < ci ^ \m(xi)\ + ^2 \m'(xi)\ 

\i=l Xi€(a,b) 

< ci [kc 2 e + kc 5 (#/(/*) + £)) 
<c,(H f {h)+ £ -^ 

for all h € (0,7?). 

Let us recall that we wish to bound e = ||/ — m||i — ||/||i from below. To this end, we consider 
two cases. 

Case 1: ||m||i < 2cqe/t]. In this case, it follows from the monotonicity of Hf that 

HjWWU) < Hj\2c 6 e/ V ) < c 7 

and therefore 

||m||ii/'^ 1 (||m||i) < 2c 6 c 7 e/r], 

i.e., 

7|H| 1 #~ 1 (|H| 1 ) < e. 
Case 2: ||m||i > 2c 6 e/ry. Set h := 2c 6 e/||m|| 1 < rj in (9.15) to obtain 

n us ( u ( 2cfs£ "\ , IHk 
IHIi < c 6 [H f - — — + 



m i / 2c, 



6 



Thus 



implying 



and therefore 



whence 



| || ^ INK , TT ( 2C & £ 

\m\\i < — h c 6 H f 



c 8 ||m||i < H f 



\m\\i 



m\\i 
2c 6 s 



H f 1 (c 8 ||m||i) < 

||m||i 

Mm II i 



2c 6 



-H f 1 (c 8 ||m||i) < e. 



This proves the theorem. □ 
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A converse result is proved in Kroo, Sommer, Strauss [1989] that shows that this estimate is 
optimal. We state it here without proof. 

Theorem 9.5. Assume M C C 1 [a, b] is an n-dimensional unicity subspace for C 1 [a, b] in the one- 
sided L l -norm problem. Then there exists an f £ C 1 [a, b] with m* the best approximant to f from 
M(f), and a sequence of elements £ M(f), rrik — > m* , such that 

11/ - mjfeHi - ||/ - m*\\i < 7||m fc - m*\\ 1 Hj 1 (D\\m k - m*||i) 

for some constants 7, D > that depend only on f and M and are independent of k. 

10. Strong Uniqueness in Complex Approximation in the Uniform Norm 

In this section, we consider the question of strong uniqueness in the space of continuous complex- 
valued functions endowed with the uniform norm. The principal result here will indicate that 
classical strong uniqueness fails, in general, in the complex setting in a Haar space. Instead, we 
shall derive a non-classical strong uniqueness type result of order 2, i.e., prove that for m* the best 
approximant to / from M, and any m £ M sufficiently close to m* , we have 

||/ — m\\ — ||/ — m*|| > 7||m — 777*|| 2 , 

where 7 > depends only on / and M. To this end, we shall need to extend some results from 
Section 2 to the complex setting. Throughout this section, C{K) will denote the space of complex- 
valued functions, continuous on the compact Hausdorff space K. First, we shall require a formula 
for the complex r- functional that is similar to the one provided by Theorem 2.1. We recall that 
Af = {x : |/(x)| = ll/ll}, and if z = x + ly where x,y £ R, then we set $lz = x and Qz = y. 

Theorem 10.1. For any functions f,g € C(K), f 7^ 0, we have 

1 



T+(f,g) = 77711 mapcK(/(a;)^(x)). 
ll/ll x€A f 

Proof: Note that for any zi,z 2 G C and t G R, we have 

\ Zl+ tz 2 \ 2 = \zx\ 2 + 2t$tzTz 2 + t 2 \z 2 \ 2 . (10.1) 
Using this relation, we obtain for every x £ Af, f / 0, 



(f)>v \f(x) + tg(x)\ 2 -\f(xr _ W(x)9W) 
+ V> 9 > - t ^o+ t(\f(x) + tg(x)\ + \f(x)\) |/(x)| 

i.e., 

r+(f,g) > tjtit max.K(f(x)g(x)). 

x€A f 



This proves the desired lower bound for r + (/,</). 

In order to verify the upper bound, choose t n — > + and let x n £ K be such that ||/ + t n g\\ is 
attained at x n . Similar to the argument used in the proof of Theorem 2.1, we can assume, without 
loss of generality, that x n — > x* £ Af. Then using the fact that \ f{x n )\ 2 < |/(x*)| 2 , we obtain 

T+{Lg) = Um \f(x n ) + t n g(x n )\i-\f(x*)\ 2 Um \f(x n )\ 2 -\f(x*)\ 2 



tn(\f(x n ) + t n g( Xn )\ + \f(x*)\) n^oo t n (\f(x n ) + t n9 {x n )\ + \f{x*)\) 

2M(f(xZ)g(x n )) &(f(^)g(x*)) 
^\f(x n )+t n g(x n )\ + \f(x*)\ ~ \f(x*)\ ■ 
(Note that since the limit on the left hand side and the second limit on the right hand side exist, 
the first limit on the right hand side must exist, as well.) Clearly this provides the needed upper 
bound. □ 
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Applying the above and the argument of Theorem 1.4, we obtain 

Corollary 10.2. Let f G C{K) and M be a linear subspace of C(K). Then m* £ M is a best 
approximant to f from M if and only if 



max — m*)(x)m(x)) > 0, m G M. 

Moreover, m* is the strongly unique best approximant to f from M if and only if 



inf max — m*)(x)m(x)) > 0. 

meM,\\m\\ = l xeA f _ m * 

The next theorem provides an analogue of Theorem 2.3 in the complex setting. Since this is a 
key result in this section, we include a proof. 

Theorem 10.3. Let f G C(K) and M be an n-dimensional subspace of C(K). Then m* G M is 
a best approximant to f from M if and only if there exist points xi, . . . , Xk £ Af- m * , and positive 
numbers Ai, . . . , 1 < k < 2n + 1, such that for every m £ M, we have 

fe 

J] A.a-m^^M^) = 0. (10.2) 

3 = 1 

Proof: (<^=) We may assume that 5^J=i = 1 an d m * = 0- Then using (10.2), we have for every 
m G M 

k k 

\\f\\ 2 = E ^lW)f{^) = E A i7(^)(/ - m )(^) ^ ii/ii • ii/ - m i 

3 = 1 3 = 1 

Dividing both sides of the above inequality by ||/|| yields that m* = is a best approximant to / 
from M. 

{=>) Assume again that m* = 0. Let M = span{mi, . . . , m n }. For any x G Af, consider the vectors 
u x := (»(7^mi(x)),-9(7^mi(x)),...,»(7^m n (x)),-9(7(i)m n (x))) G M 2n . 

Furthermore, let D := {ua;,x G Af} C M 2n . We claim that lies in the convex hull of D. Indeed, 
if this is not the case, then by the separating hyperplane theorem, there exists a c = (ci, . . . , C2 n ) G 
M. 2n such that (c, u x ) < for every x G Af. This clearly yields that for some in G M, we have 
K(/(x)m(x)) < for every x £ Af. But in view of Corollary 10.2, this contradicts the assumption 
that is a best approximant to / from M. Thus belongs to the convex hull of D C R 2n and, 
by the Caratheodory Theorem, is a convex linear combination of at most 2n + 1 points of D. 
Evidently, this verifies relations (10.2). □ 

We are now in a position to prove a strong uniqueness type result for complex Chebyshev 
approximation. This is the main result of this section and can be found in Newman, Shapiro 
[1963]. The proof as presented here is from Smarzewski [1989]. As previously, we shall call the 
n-dimensional subspace M of C{K) a Haar space if every nontrivial element of M has at most n 
distinct zeros in K. 
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Theorem 10.4. Let M be a finite-dimensional Haar space in C(K). Let f G C(K) and m* be 
the best approximant to f from M. Then there exists a c > that depends only upon f and M 
such that 

||/ - mf - ||/ - m*f > c\\m - m*\\ 2 , (10.3) 
for all m G M. In addition, ifmeM satisfies 

11/ - m \\ ~ 11/ - m *\\ < ° 

then 

||/ - m\\ - ||/ - m*\\ > j\\m - m*f, (10.4) 

where 7 := c/ (2||/ — m* \\ + <r) > and thus also depends only upon f, M and a. 

Proof: Inequality (10.4) is an immediate consequence of (10.3) since if ||/ — m|| — ||/ — m*|| < a, 
then 

c\\m-m*\\ 2 < \\f- m \\ 2 -\\f-m*f = (\\f - m\\ + ||/ - m* \\)(\\f - m\\ - \\f - m* ||) 
< (2||/_ m *|| +<7 )(||/_ m ||_||/_ m *||). 

It remains to prove (10.3). Assume M is a Haar space of dimension n. We recall from Theorem 
10.3 that 

k 

Aj(/ - m*)(x J )m(x i ) = 

i=i 

for all m G M, where xi, . . . , G Af_ m * , Ai, . . . , > and, since M is a Haar space, we have 
n+l</c<2n+l and, more importantly, 

k \ 1/2 

^Ajlm^j)! 2 ) == IIHII 

is a norm on M. Hence there exists a c > such that 

|||m||| 2 > c\\m\\ 2 (10.5) 

for all m G M. 

For each m G M and j G {1, . . . , k}, we have 

11/ -m|| 2 > |(/-m)(x,)| 2 = |(/-m*)(x J ) + (m-m*)(x J )| 2 

= ||/ - m*|| 2 + 2K ((f - m*)(xj)(m - m*)(x 5 -)) + l( m ~ m *)(^')| 2 - 

Multiply the above by Xj > and sum over j. Assuming, without loss of generality, that X)jLi \ = 
1, we obtain 

||/-m|| 2 > ||/-m*|| 2 + 2K f^A J (/-m*)(x J )(m-m*)(x J ) 
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k 

+ ^A J |(m-m*)(x J )| 2 . 

j=i 

Applying (10.2) and (10.5), this gives 

||/ — m|| 2 — ||/ — m*\\ 2 > c\\m — m*\\ 2 . □ 

Clearly, estimations (10.3) and (10.4) provide strong uniqueness type results that are weaker 
than the classical strong uniqueness result that holds in the real case when approximating by 
elements of Haar spaces. This raises the natural question of whether classical strong uniqueness 
can also hold for every complex function. Our next proposition, see Gutknecht [1978] for a particular 
example thereof, shows that this is not the case. In fact, it turns out that, in general, (10.4) provides 
the best possible estimate. 

Proposition 10.5. Let M be any subspace in C(K) containing the constant function. Assume 
that there exists an f £ C(K) such that 9/ = on K and m* = is the best approximant to f 
from M. (Such an / will clearly exist, for instance, when M possesses a basis consisting of real 
functions.) Then for m := ib G M, b G R, we have the following converse to (10.4): 

II * 119 

777 — 777 

||/- m ||-||/- m *||< " 2||/|| " . (10.6) 

Proof: Indeed, by the above assumptions, ||/ — m\\ 2 = ||/|| 2 + ||m|| 2 and since m* = 0, the above 
inequality easily follows. □ 

From (10.6), it follows that there cannot exist a 7 > such that 

||/ — m\\ — ||/ — 777*|| > 7||m — 77l*|| 

for all m G M. 

Despite the fact that classical strong uniqueness fails, in general, in the complex case, it is 
possible to give a relatively simple sufficient condition for it to hold. Let / G C(K) and assume 
that 777* is its best approximant. We shall say that the set of points A C ^4/- m * is extremal if m* 
is a best approximant to / from M also on the set A. Moreover, A is a minimal extremal set if no 
proper subset of A is extremal. Clearly, any set of points x\, . . . , G Af_ m * , k G {1, . . . , 2n + 1}, 
satisfying conditions (10.2) of Theorem 10.3 must be extremal. Hence the cardinality of any minimal 
extremal set can be at most 2re + 1. This also follows easily from Corollary 10.2. 

We now give a sufficient condition for classical strong uniqueness to hold in the complex case. 
This sufficient condition is to be found in Theorem 2 in Gutknecht [1978]. 

Theorem 10.6. Let / G C(K) and M be an n-dimensional subspace of C(K). Assume that m* 
is a best approximant to f from M possessing a minimal extremal set of cardinality 2n + 1 . Then 
777* is a strongly unique best approximant of f, i.e., for all m G M, we have 

||/ — 777|| — ||/ — 777*|| > 7||777 — 777*||, 

for some 7 > 0. 

Proof: Let {x\, . . . ,X2 n +i} C Af_ m * be a minimal extremal set. Then by Theorem 10.3, there 
exist corresponding positive numbers Ai, . . . , Mn+i such that (10.2) holds with k = 2n + 1. Assume 
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that to the contrary m* is not a strongly unique best approximant to / from M. Then by Corollary 
10.2, there exists an m G M, m-o / 0, such that 



max — m*)(x)m (x)) = 0. 

x€Af_ m * 

Since relations (10.2) hold for this m-o, we obtain that 



X((f-m*){x j )m o (x j )) = 0, j = l,...,2n + l. (10.7) 

Let {mi, . . . , m n } be a basis for M. For each j G {1, . . . , 2n + 1}, consider the vectors 

uj := («((/- m'X^Kfo)), -3f((/ - m*)(x J )m l (x J )), i = 1, . . . ,n) € K 2n , 

and set £ := {u.,} 2 "! 1 C M 2n . Then relations (10.7) yield that the set B lies in a (2n - 1)- 
dimensional hyperplane H of M. 2n . Moreover, by the standard argument repeatedly used above, 
belongs to the convex hull of B (otherwise relations (10.2) would fail for some m G M). Now, 
using the fact that the convex hull of B is of dimension In — 1, it follows from the Caratheodory 
Theorem that is a convex linear combination of at most In points from B. But this in turn 
means that (10.2) holds for a proper subset of {xi, . . . ,X2 n +i} (with some A's). Clearly this set of 
at most 2n points is extremal too, contradicting the condition of minimality of the extremal set 
{xi,...,x 2n +i}- □ 

Thus if m* is a best approximant with minimal extremal set consisting of exactly 2n+ 1 points 
then the best approximation is strongly unique in the classical sense. This result is not vacuous. 
It is easy to construct examples that satisfy the criteria of Theorem 10.6. Our next example, see 
Gutknecht [1978], shows that we do not necessarily have classical strong uniqueness for a minimal 
extremal set consisting of fewer than 2n + 1 points. 

Example. Let K := {\z\ = 1 
degree at most n — 1, and 



Then it is easy to see that ||/|| = 1 and Af = {z^ := e ink / n : k = 1, . . . ,2n}, i.e., At is the set of all 
roots of unity of order 2n. In particular, f(zk) = (— l) fc , k = 1, . . . , 2n. We claim that m* = is 
the best approximant of /. Indeed, if for some m G M, we had ||/ — m|| < ||/|| = 1 then evidently 

(-l) fc Km(z fc ) > 0, fc = l,...,2n, (10.8) 

where m = Y^jZo djZ-*, dj = aj + ibj. Now 3?m(e 1 *) = Y^Zo( a j cos sinjt), i.e., T n := Smfe 1 ') 

is a trigonometric polynomial of degree at most n — 1. By (10.8), T n has at least 2n — 1 distinct 
zeros on the unit circle. Thus, we must have T n = 0, and that contradicts (10.8). Hence m* = is 
a best approximant to / from M. It is also the unique best approximant since M is a Haar space. 
Note that by Corollary 10.2 this best approximant is not strongly unique in the classical sense since 
choosing m to be an imaginary constant, we have 



max 3?((/ — m*)(x)m(x)) = max 9(/(x)) = 0. 



: z G C) be the unit circle, M := II n _i the space of polynomials of 
/(*) ■■- 



z n + z 3n 
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It remains to show that the extremal set Af = {zj~ := e 17rfc / n : k = 1,... , 2n} is minimal. Let us 
delete any point of Af, say zi, and assume that the remaining set {zk '■= e 17rfc / n : k = 2, . . . , 2n} 
is still extremal. (From symmetry considerations, deleting z\ does not restrict the generality.) 
Clearly, we can choose a trigonometric polynomial T* = Rmfe' 1 ), m € H n -i, of degree n — 1 such 
that (—l) k T*(kir/n) < 0, k = 2, . . . , 2n. But in view of Corollary 10.2 this means that m* = is 
no longer a best approximant on this set of 2n — 1 points. Thus the extremal set Af consisting of 
2n points is minimal, but the best approximant is not strongly unique in the classical sense. 

Hence the condition of Theorem 10.6 is sharp, in a certain sense. Nevertheless, as the next 
example verifies, this condition is not necessary for classical strong uniqueness to hold. 

Example. Let K := {\z\ = 1 : z € C} be the unit circle, M := IT n _i be the space of polynomials 
of degree at most n—1, and f(z) := z n . Then Af = K and as in the previous example the function 
m* = is the unique best approximant to z n from M with the set {zt '■= e 17Tk / n : k = 1, . . . , 2n} 
being a minimal extremal set of 2n points. Let us show that this best approximant is strongly 
unique in the classical sense. Indeed, if classical strong uniqueness is not valid then by Corollary 
10.2 for some nontrivial m = ^j=o djZ-i , dj = aj + ibj, we would have 

max $l(z n m(z)) = 0. 
kl=i 

On the other hand, evidently T n := %t(z n m(z)) = Y^=o a j cos(n—j)t+bj sin(n—j)t is a nonpositive 
trigonometric polynomial of degree n such that 

\ T n (t)dt = 0. 
Jo 

Thus T n must be identically zero implying that m must be identically zero. By this contradiction, 
it follows that the best approximation is strongly unique in the classical sense. In fact, the optimal 
strong uniqueness constant in this case is l/n; see Rivlin [1984a]. 

An exact necessary and sufficient condition for when one has strong uniqueness in the classical 
sense is to found in Blatt [1984], who credits the result to Brosowski [1983]. These conditions are 
somewhat technical and are not detailed here. 

Part III. Applications of Strong Uniqueness 

We shall briefly consider various applications of strong uniqueness type results. The main idea 
behind these applications is the following: instead of solving the best approximation problem in a 
given norm, we replace it by considering another norm, close to the original norm, one that leads 
to a simpler approximation problem. Strong uniqueness is then applied in order to show that the 
best approximant in this new norm is sufficiently close to the original best approximant. Typically, 
the original norm is modified by replacing it by a similar discrete norm, or by introducing a weight 
function into the norm. We first start with some general remarks concerning approximation in 
nearby norms, and then proceed to a discussion of specific examples. 

11. Strong Uniqueness and Approximation in Nearby Norms 

Let X be a normed linear space with norm || • ||, and let M a finite-dimensional subspace of 
X. Assume that the sequence of seminorms || • ||fc, k G N, approximate the given norm, i.e., 
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liirifc^oo \\f\\k = ll/H, for all / el We also assume that for a given / el, its best approximant 
Pm(J) G ^ with respect to the norm || • || is unique, and denote by Pffiif) £ ^ the set of best 
approximants to / in the seminorms || ■ ||jt, k G N. We are interested in estimating the deviation of 
Pj^\f) from Pm(/)- This approach was first considered by Kripke [1964] and Peetre [1970]. 

To estimate the deviation of Pj£\f) from Pm(/), we introduce a quantity measuring the 
deviation of || • \\k from || • || uniformly on the set S(M) := {m G M : ||m|| = 1}, the unit sphere in 
M, namely 

r]k(M) := sup | ||m|| fc - 1|. 

meS(M) 

Lemma 11.1. For any finite-dimensional subspace M C X, we have 

lim n k (M) = 0. 

fe— >oo 

Proof: Assume to the contrary that for a subsequence ki, there exist elements Ui G S(M) such 
that 

I Kk - i| > 5 > 0, ieN. (li.i) 

Let Uj = S" =1 a*mj, where mi, . . . , m n is a basis for M. Since S(M) C X is compact, we may 
assume, without loss of generality, that converges to u* as i — > oo for some -u* = Y7- =l a*rrij G 
S(M), i.e., ||n*|| = 1. Moreover, by the equivalence of norms in finite-dimensional spaces, 



In addition, 



Thus 



lim max [a* — a*\ = 0. 

i— >oo l<j<n J J 



i— >oo ■ i — » ■ i — » 



\u*-Ui\\ kt < max |a* - a*| ^ ||mj|| fci , (H- 2 ) 



and the right-hand-side converges to as i — > oo. 
Hence, we obtain by (11.2) that 

lim |1 - \\ui\\ ki \ < lim (|1 - ||u*|| fc .| + \\u* - Ui\\ ki ) = |1 - ||u*|| | = 0. 

i— >oo i— >oo 

But this clearly contradicts (11.1). □ 

Corollary 11.2. If k G N is sufficiently large so that r]k(M) < 1/2, then for all m G M, we have 

2„ „ „ , 

gll^llfc ^ \\ m \\ < 2 1| 777. || fc . (11-3) 

Assume now that for a given / G X, with unique best approximant Pm(/) £ non-classical 
strong uniqueness holds. That is, we assume there exists a nonnegative strictly increasing function 
4> (depending only on / and M), defined on [0, <r], such that for all m G M satisfying ||/ — m|| — 
11/ -PmU) II <<7> we have 

||/ - m|| - ||/ - P M (/)|| > 0(||m - Pm(/)||). (11.4) 

Note that the above relation yields that <p(t) < t on [0, a]. Moreover, if <p(t) > ct thereon then 
classical strong uniqueness holds at /. Using Lemma 11.1 and (11.4), we will estimate the deviation 
of PjjPif) from Pm(J)- In what follows, r/fe(/+M) will be an abuse of notation for ^(spanj/, M}). 
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Proposition 11.3. Assume that M is a finite dimensional subspace in X and a strong uniqueness 
type estimate of the form (11.4) holds for a given f G X. Then for any best approximant Pffif) 
and any k sufficiently large so that n k (f + M) < 1/2, we have 

cf>(\\P^\f)-P M (f)\\)<8\\f\\ Vk (f + M). 

In particular, 

lim \\PfiHf)-P M (f)\\=0. 

k — >oo 

Proof: From (11.3) applied to both Pff(f) and /, and since \\P^\f)\\ k < 2||/|| fc , we have 

11/ - P£\f)\\ < ll/ll + \\P { M ] (f)\\ < 11/11 + 2||P£ ) (/)|| fe < 11/11 + 4||/|U < 7||/||. 
From the definition of n k (f + M) and the above estimate, we have 

I 11/ " P { m\I)\\ " 11/ " Pi?(/)IW < 11/ - P$ (f)\\Vk(f + M)< 7||/||%(/ + M). 

Thus 

\\f-P ( M\f)\\ < ||/-Pi fc) (/)IU + 7||/|k(/ + M) < ||/-P M (/)|| fc + 7 ||/||r ?fc (/ + M) 

< 11/ - PmU)\\ + 11/ - PM(f)\\Vk(f + M) + 7\\f\\vk(f + M) 

< \\f-PM(f)\\+8\\f\\ Vk (f + M). 

Combining now the last estimate with (11.4) immediately yields the desired statement. The limit 
follows from Lemma 11.1 since 



lim n k (f + M) = 

k— >oo 



and (f>(0) = while <f>(t) > for t > 0. □ 



Remark. The above proposition yields the general estimate 

\\P { M\f) ~ PmU)\\ < rHcmif + M)), (11.5) 

where, by Lemma 11.1, the quantity on the right hand side tends to as k — > oo. In order to use 
this estimate to obtain rates of convergence, one needs to find sharp bounds for both rj k and (f>. In 

(k) 

many instances, this leads to sharp estimates for the deviation of P M (/) from Pm(/) (for example, 
discrete Chebyshev approximation, Polya algorithm). However, in some cases, this approach does 
not yield sharp bounds, even when both r] k and (j) are precisely determined (this happens, for 
example, in the case of discrete Li-approximation). In what follows, we provide a brief summary 
of these results. 



12. Discretization of Norms 



It is considerably easier to solve an approximation problem when the L p -norm is replaced by a 
discrete L p -norm. In this section, we shall investigate the size of the error of this discretization 
technique. 



Strong Uniqueness 75 

Let us denote by 

\\f\\ k := max \f(j/k)\ 

0<j<k 

and 

fc-i 



T^(Ei^'A)n 1/p , i<p<oo 



k ' p ■'- fci/p 

the discrete uniform and L p -norms on [0, 1], respectively. 

In what follows, we deal with the example M = U n . We first consider the case of the discrete 
uniform norm. 

Theorem 12.1. Let / G C 2 [0, 1] and M = U n . Set X := C[0, 1], with the usual uniform norm 
thereon, and 

H/IU := max \f(j/k)\. 

0<j<fe 

Then there exists a constant C such that 

m(f + n„) < (C||/"|| + 4n 4 (l + C||/||))/(8A; 2 ) 

for all k, and therefore 

\\p^(f)-p M (f)\\ = o(k- 2 ). 

Proof: We know from preceding results that classical strong uniqueness holds in this case, i.e., 
we can use estimate (11.5) with (f)(t) = "f n {f)t, where 7„(/) is the strong unicity constant for the 
function /. We now need to estimate f +Il n ) for / G C 2 [0, l]\n„. Let g £ S(f+U n ) and assume 
that ||g|| = g{x*) = 1, x* G [0, 1]. If x* = j/k for some j G {0, 1, ... , k}, then ||<7|| = \\g\\k = 1- 
Thus, we may assume that x* ^ j/k, j G {0,1, ... ,k}, i.e., in particular x* G (0, 1) and g'(x*) = 0. 
Using Taylor's formula, we have that for every x G [0, 1], 

g( X )=g( X *) + ^l( X - X *)l (12.1) 

with some £ between X and X * . Obviously, we can choose a j G {0, 1, . . . , k} so that \ X *—j/k\ < l/2k. 
Setting x = j/k in (12.1) yields 

i = \\g\\<\\9h + ^y% 

Finally, g = af + p n for some p n G II ra . As ||<7|| = 1 and / ^ U n , it easily follows that \a\ < C = 
[min pe n„ 11/ — Thus < 1 + C||/||. Therefore by the Markov inequality, 

||5"||<C||r||+4n 4 (l + C||/||). 

From these estimates, we obtain 

m(f + n„) < (C\\f" || + 4n 4 (l + C\\f\\))/(8k 2 ). 
In particular, rjk(f + II n ) = 0(l/k 2 ), and hence by (11.5), we have 

\\p£\f)-PM(f)\\ = o(k- 2 ). □ 

The above estimate turns out to be sharp, in general. 

Let us now turn our attention to the discrete L p -norms, 1 < p < oo. 
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Theorem 12.2. Let / G C^O, 1] and M = II n . Set X := L p [0, 1], 1 < p < oo, and 



fc-i 



: =t^(£i/(^)I p ) 1/p - 



i=o 



Tien there exist constants C p and c p such that 
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m(f + n„) < V c p -{\ + c p ||/|| p + c p \\f\\ p ) 



for all k, and therefore 



P { M ] (f) ~ PmU)\\p = 0(k- 9 *), 1< p < oo, 



where 8 p : = min{l/2, 1/p}. 

Proof: Consider / £ C^fO, 1] and let us estimate %(/ + IT n ) from above. Setting := [j/k, (j + 
l)/k], we obtain, by repeated application of Holder's inequality, for any g G C^O, 1], 



I? - llsllL 



1 



fe-1 



1 fe-1 

|^)rdx--^b(jA)| 



i=o 



k— 1 fc— 1 / \ 

E/ (i^)i p -b(iA)i p )dx < W [/ piy(orV(oi<if 

j=0 J &k j=O jA k yj/k ) 



1-1/ p 



1-1/p 



\g'm p ^ 



1/p 



fe-1 . 

E / i^)r 



1/p 



Thus 



IS " \\9\\l, P 



^IWgFp-'WWp 



IhWp-'yWp- 



which in turn implies 



\\g\\p - \\9\\k 



< fll<7%- 



(12.2) 



Let now g := af + p n E S(f + U n ). As ||p|| p = 1, it follows, as above, that \a\ < C p = 
[min pe n„ 11/ — p||p] _1 - Thus \\p n \\p < 1 + Cp||/||p and using the L p -Markov inequality, we obtain 
HKJIp < c p n 2 (l + Cp||/||p). Combining this estimate with (12.2) yields 



n 



Vk(f + n n ) < pCp -(i + c p ||/|| p + c p \\f\\ p ). 



Thus, we can use the estimate 



%(/ + n n ) = 0(fc" 1 ), l<p<oo, 



(12.3) 
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in (11.5). Recall that by Corollary 7.2 and the subsequent Remark in the case of the L p -norm, 
1 < p < oo, we have the estimate <t)~ l {t) = O(t 0p ) where 6 P := min{l/2, l/p}. Thus, we obtain by 
(11-5), 

\\P£\f) - PmU)\\ p = 0(k~ e '), 1< p < oo. □ 

Remark. When p = 1, Theorem 9.2 implies that for / G C^fO, 1], non-classical strong uniqueness 
holds with (j)(t) = ct 2 . Thus repeated application of (12.3) and (11.5) yields 

\\PS\f)-PM(f)\\l=0(k-^). 

When p = 1, it is shown in Kroo [1981b] that if / G C 2 [0, 1] and / — Pm(/) has a finite number of 
zeros then the above estimate can be replaced by the sharp upper bound 

\\P^(f)-PM(f)h=0(k-l). 

13. Asymptotic Representation of Weighted Chebyshev Polynomials 

In this section, we shall use strong uniqueness results in order to solve approximation problems 
in the case where the norm is altered by a weight function. This approach will be used to derive 
asymptotic representations for weighted Chebyshev polynomials. Let w > be a positive weight 
on [—1, 1] and denote by 

T n>p (x,w) := x n + q n -i(x), q n -i G Il n _i, 

the monic polynomial that deviates least from in the weighted L p -norm on [—1,1]. That is, 

\\T ntP (x,w)w\\ p = inf{||(x n + g„-i(x))w\\ p : g n -i G n„_i}, 1 < p < oo. 

In addition, let T* p (x,w) := T n;P (x, w)/\\T ntP (x, w)w\\ p be the normalized Chebyshev polynomial. 
The problem of finding asymptotic representations for weighted Chebyshev polynomials has been 
studied by many authors, but satisfactory solutions were given only in the case p = 2; see Bern- 
stein [1930]. In addition, in the case p = oo, Bernstein [1930] gave an asymptotic formula for 
\\T nj00 (x,w)'w\\ 00 , Fekete, Walsh [1954/55] found the n-th root asymptotics of these Chebyshev 
polynomials, while in a more recent paper, Lubinsky, Saff [1987] gave their asymptotics outside 
of [—1, 1]. In this section, based on results of Kroo, Peherstorfer [2007], [2008], we shall outline a 
complete solution to this classical problem. This solution will be based on the strong uniqueness 
estimates derived in the previous sections. Another basic tool consists of the fact that for the spe- 
cific weight w = l/p m , p m > 0, p m G n m , m < n, an explicit formula for the minimal polynomials 
was already found by Chebyshev (see Akhiezer [1947]). 

The case p = oo. It is known (see Akhiezer [1947] ) that 

T n * >oo (cos0, l/ Pm ) = M{z- n g 2 m (z)}, z = (13.1) 

where p m G II m is a polynomial positive on [—1, 1] and g m G Il m is the certain polynomial (unique 
up to a multiplicative constant of modulus 1) all of whose zeros lie in \z\ > 1 and such that 



|Sm(e i0 )| 2 = p m (cos0), 4>e[0,n}. 
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Moreover, the corresponding Li-Chebyshev polynomial is given by 

lS_ 1 , 1 (cos<M/p ro ) = -%{z- n g 2 m (z)}/sm^ z = e i *. 

In addition, these polynomials satisfy the relation 

K{.-^)} 2 + -s{z- n gl{z)} 2 = p 2 m . (13.2) 

It follows immediately from (13.2) that the zeros of ^s{z~ n g^ n {z)} are the n + 1 equioscillation 
points of T* ^(cos <p, \/p m ). Moreover, in view of the alternation theorem (Theorem 4.4), (13.2) 
also implies that 

r*_ liOO (cos0,Vl-x 2 /Pm) = %{z- n g 2 m (z)}/smd>. (13.3) 

So a natural idea is to approximate a positive weight w by reciprocals of positive polynomials l/p m 
(the degree of approximation can be estimated by the Jackson Theorem), and then apply strong 
uniqueness type results in order to obtain the required asymptotics. Clearly, we shall need rather 
precise strong uniqueness type estimates that also take into account the dependence upon n. 

We first recall that the positive continuous weight function tu(cos 0) can be represented in the 

form 

™ (cos0) = Re^yp' 

where n(z), the so called Szego function of w, is the function nonzero and analytic in \z\ < 1 given 
by the formula 

f>27T 



n(z) :=exp|-i-^ i_±|l ogw (cos^)d0|. 



Let us denote by C k+a [— w, it] the class of 27r-periodic functions whose A;-th derivative, k G N, 
satisfies the Lip a property. This next result provides an asymptotic formula for the weighted 
Loo-Chebyshev polynomials for positive weights w(coscp) G C 2+a [— it, tt]. 

Theorem 13.1. Let w(cos<p) G C 2+a [-ir, tt] with < a < 1, w(x) > 0, x G [-1, 1]. Then 

T^cos^w) = M{e-' in H<^)) 2 } + 0(n- a ) 

uniformly for <f> G [0, it] . 

First, we shall need a lemma estimating the deviation between Chebyshev polynomials corre- 
sponding to different weights via the strong unicity constant. 

Lemma 13.2. Let w\,w 2 G C[— 1,1] be positive weight functions. Then 

Q 

ll T n,oo(-^l)^l - T n,oo(-,W 2 )w 2 \\ < . ||wi-W 2 ||, 

ln-l{ 1 n,oo\ UJ l)) 

where r y n -i(T* 00 (wi)) is the strong unicity constant ofT* 00 (wi)wi with respect to the Haar space 
wiU n -i, and c > depends only on wi,w 2 . 

Proof: We shall denote below by cj constants depending only on w\,w 2 . Clearly, 

T n,oo(-,^i) = a n x n -pi =: T*, T* tOC (-,w 2 ) = b n x n - p 2 =: T 2 * , p 1 ,p 2 eU n _ 1 . 
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Denote by Tl := T 1 */a ra ,T 2 := T 2 */6 n the corresponding monic polynomials. Then using the 
extremal property of Ti, we obtain 



— = \\TiwA < \\T2wA < 
a n 



T 2 w 2 — 

W2 



1 

< — 



W2 



Replacing Ti by T 2 , we can obtain an analogous inequality 



1 1 

— < — 



W 2 
Wl 



Thus combining these estimates, we have 



1 a u 

1 — Cl Wl — W2\\ < < - — < 



w 2 



Wl 



< C2\\W1 — W2W + 1. 



It therefore follows that 



< c 3 \\wi - w 2 \ 



(13.4) 



Set now 



q:=Pi- -7-P2 = T X - —T 2 G II n _i. 

On On 



(13.5) 



Since is the best approximant to ttJiTj* in wiU n -i in the uniform norm, it follows by the classical 
strong uniqueness inequality (applied to the Haar space wiH n -i ) together with (13.4) and (13.5) 
that 



7n _i(T 1 *)||(7«;i|| < \\(T*-q)w 1 \\-\\T*w 1 \\ = -JL\\T; Wl \\ - \\T* Wl \ 



a n Mm* 11 1 ^ a r 
n r , 



w 2 



-1 < -^(1 + c 4 \\ Wl - w 2 \\) - 1 



(13.6) 



< (1 + c 3 \\wi - w 2 \\)(l + c 4 ||u;i - 102 1|) - 1 < c 5 ||«;i — k^H, 



where 7 n _i(T 1 *) is the strong unicity constant of T^w\ with respect to the Haar space w\Tl n -i. 
Finally, using (13.4) and (13.6), we arrive at 



\\T^w 1 - T 2 *ttJ 2 || < \\qwi\\ + ||T 2 *(tu 2 - -t^wi 



< \\qwi\\ + ||T 2 *(u; 2 - Wl )\\ + ||T>! (l - ^ || < 



C6 



7n-i(rf) 



\W\ - w 2 \\, 



which is the required estimate. □ 
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We now need to estimate the strong unicity constant 7„_i(T* ^(^i)) (which is the strong 
unicity constant of T* ^{wijwi with respect to the Haar space w\Tl n -i) in the special case when 
w\ = 1/pm- Moreover, it is important in deriving asymptotic relations that the dependence of this 
quantity on n is revealed. We shall use Theorem 4.10 of Section 4 in order to provide a precise 
bound. 

Lemma 13.3. Let p m G II m , m < n, be such that < A < p m < B on [—1,1]. Then 

7„_i(T^(.,l/p m ))>-^, (13.7) 
where the constant c depends only on A and B. 

Proof: Let — 1 = x\ < ■ ■ ■ < x n+1 = 1 be the equioscillation points of T* oc (-, l/p m )/pm and 
"ifc G II n _i, 1 < fe < n + 1, be the polynomials of Theorem 4.10 with respect to the Haar space 
n n _i/p m , i.e., 

rrikix^/pmixj) = sgnT* i00 ( Xj , l/p m ), l<j<n + l, j ^ k. 
By the second statement of Theorem 4.10, 

ln-i{T* l/p m )) = mm (13.8) 

l<fe<n+l \\mk/Pm\\ 

Set now for any 1 < k < n + 1, 



Q k (x) := (x - x k )T* t00 (x, l/p m ) + (1 - x 2 )T*_ hoo (x, \J 1 - x 2 /p m ). 

It follows by (13.2) and (13.3) that the equioscillation points of T* tQO (x,l/ p m )/ p m are the zeros 
of (1 — x 2 )T*_ 1 ^ QO (x, \/l — x 2 / p m ). Moreover, since m < n, the relation (13.2) also implies that 
the leading coefficients of T* tQO (x, l/p m ) and T*_ l oo (x, vT— x 2 /p m ) coincide. It therefore follows 
that Qk G n n , Qk{xk) =0, 1 < k < n + 1, and 

mk ( x ) = !M°L j l<fc<n + l. 

X Xfc 

Obviously, ||Qfc|| < c, 1 < A; < n + 1, with a constant depending only on A, B. This easily implies, 
by the Markov inequality, that ||mfc|| < cin 2 , 1 < k < n + 1. Applying this inequality together 
with (13.8) yields (13.7). □ 

Proof of Theorem 13.1: When < a < 1, the smoothness condition imposed on the weight 
w implies that its Szego function n(z) is also C 2+a on \z\ = 1 (see Kroo, Peherstorfer [2008] for 
details). Therefore there exists a sequence of polynomials g m G II m such that uniformly on \z\ < 1 

\7r(z)-g m (z)\<-^. (13.9) 

Since tt(z) / on \z\ < 1, it follows that for m > m the function g m also does not vanish in \z\ < 1 
and 

Pm (cos0) := \g m {^)\ 2 > c> 0. (13.10) 
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Furthermore, by Chebyshev's result (see (13.1)), 

T n * ^(cos 0, l/p m ) = z = e^. (13.11) 

We are now ready to verify the statement of the theorem. Indeed, since 

U;(C ° S0) = |7r(et)P ' 

we have by (13.9) and (13.10) that 



w(x) 



<-^, *€[-!,!]. (13.12) 



Pm (#) 

Thus using Lemmas 13.2 and 13.3 together with (13.9), (13.10) and (13.12), see also (13.11), we 
obtain 

llr^^os^^-^e--^^)) 2 }!! < ||r* i00 (cos^«;)-T* i00 (cos^l/p m )|| + 

||T,: i00 (cos^l/p m )-»{e- i ^(7r(e i ^) 2 }|| < x = cos0. 

Putting in the last estimate m := [n/2\ yields the statement of the theorem. □ 

The case 1 < p < oo. Similarly to the case p = oo, the following explicit representation 
for weighted Chebyshev polynomials is given in Akhiezer [1947] for the weight function p m ^ p := 
7=f p/2 . where p m G II m is positive on [-1,1], 

V -L # Pm 

T* p {cos<j), Pm , v ) = \ p ^{z- n g m (z)}, z = e^, 
where \ p is a constant depending only on p and, as above, g m is related to p m by 

|Sm(e i0 )| 2 = p m (cos<p), 4>e[0,ir\. 

p/2 

Let now to be a positive weight and set w p := J' 1 _ x i , 1 < p < oo. 

The next theorem provides an asymptotic representation for L p -Chebyshev polynomials when 
1 < p < oo. 

Theorem 13.4. Let 1 < p < oo. Then for any positive weight w such that w G C a [— ir,iv], 
< a < 1, 

T n ycos^,u; p ) = A p K{z-"tt(z)} + 0(n" Q ^), z = e 1 *, 

wiiere n(z) is the Szego function of w, P := min{l/2, 1/p}, and the O(-) is taken with respect to 
the L p -norm. 

Since the main idea in the proof of Theorem 13.4 is similar to the proof of Theorem 13.1, we 
shall just briefly outline the proof. First, it is shown that, similarly to Lemma 13.2, under suitable 
conditions on the weight functions wi, W2 > A > 0, we have 

||T* iP (cos0,u;i) - T^ p (cos</>, w 2 )\\ p < c A , P \\wi - w 2 ||p p - 

It should be noted that instead of classical strong uniqueness, we use here the fact that in L p - 
approximation with 1 < p < oo, non-classical strong uniqueness with </> _1 (t) = 0(t 9p ) is satisfied, 
and the constants involved depend only on p. 

Finally, similar asymptotic relations can also be given when p = 1. In this case, non-classical 
strong uniqueness with </> _1 (i) = 0(t 1 / 2 ) takes place (this follows from Theorem 9.2 applied in the 
case of Lip 1 functions). In addition, similarly to Lemma 13.3, one has to study how the constants 
involved depend on n. See Kroo, Peherstorfer [2008] for details. 
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